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(54) Channel estimation 

(57) The invention discloses a method for under-pa- 
rameterizing the channel estimate in UMTS/GSM radi- 
omobile Base Station Receivers exploiting M-antennas 
either in space diversity or closely-spaced. A Least 
Square (LS) space-time channel matrix H LS is firstly es- 
timated for K active users and then spatially and tempo- 
rally pre-whitened with a spatial noise covariance matrix 
R ns and a midamble correlation matrix R XJP respectively. 
The whitened elements of H LS are so describable by 
means of a relevant statistical distribution of the ampli- 
tudes. In time slotted GSM/CDMA systems the spatial 
noise covariance matrix R ns is averaged on NS succes- 
sive time slots, depending on the stationariness of the 
channel and the permissible mobile speed. A suitable 
classification variable and a respective classification set 
are both determined either for single-slot or multislot sta- 
tistical sample. A discriminating threshold S which min- 
imizes the probability of uncorrect classification is iter- 
atively estimated, together with the statistical parame- 
ters of the useful and noise subset, staring from an ar- 
bitrary initial probability an element belong to the useful 
subset. Rayleigh, Exponential, Chi-squared, Erlang, 
and Gamma distributions are all usable. Classification 
generates a space-time mask matrix which is empirical- 
ly adjusted to perform deep-temporal merging through 
the NS multislot and spatial merging through the M an- 
tennas. By effect of the final mask application the chan- 
nel matrix H LS assumes a band configuration H S7B by 
which if a predetermined percentage of the useful ele- 
ments relevant to all the M antennas at the same sam- 
ple-time is greater than a fixed value, i.e. 30%, depend- 



ing on the distance between two adjacent antennas, all 
the elements relevant to said sample-time are taken as 
useful too while the remaining are zeroed. A suitable 
Base Station receiver exploits the band channel matrix 
H stb {0 jointly ZF-BLE/MMSE detect the transmitted da- 
ta (fig.39). 
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Description 

FIELD OF THE INVENTION 

[0001] The present invention is referred to the field of the advanced cellular telephony systems and more precisely 
to a band masking method of space-time channel matrices in mobile communication systems. 

BACKGROUND ART 

[0002] Background art is considered in view of the aim of the present invention which is that to achieve complexity 
reduction in channel estimation. As known, Maximum Likelihood (ML) Channel estimation is performed by minimizing 
Mean Square Error (MSE) between the observed variables in correspondence of the reception of a training sequence 
and those calculated, under certain hypotheses on the noise statistics, by using a channel model in conjunction with 
said training sequence stored in the receiver. A complexity reduced receiver in comparison with a full complexity re- 
ceiver shows improved channel estimation and a simplified subsequent data detection. An increase of the complexity 
in channel estimation is mainly due to the attempts of better exploiting the intrinsic physical behavior of the channel, 
specially in complex systems like the most recent proposed ones based on Code Division Multiple Access (CDMA) 
techniques. Mostly of the traditional GSM Base Stations (BSs) exploit tri-sectorial antennas for allocating in each sector 
an user at a time on a single channel characterized by Frequency Division Multiple Access (FDMA) and Time Division 
Multiple Access (TDMA), with Frequency Division Duplexing (FDD) for two-way communications. Due to the dispersive 
length Woi the temporal channel, W samples of the user baseband response shall be considered in the estimation 
and the complexity is about of W unknown parameters to be estimate. A net improvement in channel estimation in 
presence of inter-cell interferents has been obtained in uplink by supplying the Base Stations (BSs) with directive arrays 
able to exploit spatial other than temporal dimension (the so-called smart antennas). The improvement is mainly due 
to a beamforming suitable for detecting and rejecting cochannel interferents coming from the neighbouring cells. An 
alternative way to improve the channel estimation and data detection is that to exploits set of non-closely spaced 
antennas (spaced more than A/2); this arrangements is known as space diversity reception and the improvement is 
due to the uncorrellated fading on the various signals which are opportunely summed up by the receiver. In comparison 
to the traditional GSM receivers the ones equipped with set of M antennas have to estimate MW unknown parameters, 
and the complexity increases by M times consequently. 

[0003] Third generation Universal Mobile Telecommunication Systems (UMTS) add up further improvement in data 
transmission; in this systems the maximum permissible bit-rate is increased by a trade-off between the bit-rate and 
the number Kof simultaneous users allocated on the same CDMA channel. As known, CDMA technique is implemented 
convolving each oversampled data bit with a pseudo-noise spread sequence, Q-chips long, temporarily assigned by 
the network to both a so-called Resource Unit RU (i.e. the Mobile Station MS) and the BS for transmitting and receiving 
packet data. Ideal spread sequences are each other orthogonal, that means a generic sequence is perfectly recovered 
at the output of a matched filter, while the same sequence generates noise at the output of a filter matched to another 
sequence. The original frequency band of data symbol is enlarged to cover the overall transmission band, and the 
original information scattered consequently. The orthogonal property has to be also possessed from the midambles 
which are not subjected to spreading code but nevertheless shall make the Kusers each other distinguishable in uplink, 
as will be detailed later on. As far as concerns the channel estimation in UMTS Base Stations supplied with traditional 
antennas, the complexity of the estimation is about KW unknown parameters to be estimated during the burst time, 
while for BS supplied with set of M antennas the complexity increases to MKW unknown parameters. 
[0004] Valid background art which clarifies, if not otherwise provided for, that the increased complexity has adverse 
effect on channel estimation and successive data detection is constituted by the article [REF. 1] of B. Steiner, P. Jung, 
titled: "Optimum and Suboptimum Channel Estimation for the Uplink of CDMA Mobile Radio Systems with Joint De- 
tection", Eur. Trans. Telecommun. Related Technol. (ETT), Special Issue on "Multiple access in radio communications 
networks", Vol. 5, pp. 39-50, 1 994. The cited article says that as far as concerns midamble sequences and consequent 
channel estimation in CDMA systems, because of point-to-multipoint transmission in downlink direction (BS versus 
RUs) and multipoint-to-point transmission in uplink (RUs versus BS) an asymmetric scenario is recognizable at the 
on-air interface. More precisely, a unique midamble has to be transmitted in downlink towards the K'RUs, while the K 
RUs simultaneously transmitting in uplink (not necessarily equal to K) must use K different midambles, each other 
orthogonal, in order to be recognizable at the BS after preliminary synchronization. In that the channel estimation task 
is more expensive at the Base Station side and can be further complicated by introducing a set of antennas for the 
aforementioned reasons. The article of [REF. 1] discloses that the K orthogonal midambles for uplink are obtainable 
through as many phase shifts of a unique periodic sequence having period P=KW chips, where W \s the maximum 
significant time dispersion of the estimating channel. Obviously the longer is the midamble the shorter is payload, 
assuming the burst length unchanged, and spectral efficiency is poorer consequently. 
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[0005] Being the complexity reduction the aim of the present invention, the uplink channel estimation is only consid- 
ered in the following disclosure. Complexity reduction can take advantage from powerful DSP processors included in 
the BS receivers. The article of [REF. 1 ] points out that that the ML estimation is unbiased because the channel model 
is full-dimensional, to say it includes all the unknown parameters which are needed to describe the channel without 

5 any a-priori hypothesis on the channel structure. The price to be paid for the unbiasedness of the estimation in com- 
parison with the biased estimation performed by a filter matched to the channel is an S/N degradation of the estimated 
elements. Obviously the channel estimation is only possible when the number U of unknown parameters (the channel 
coefficients of the FIRs modeling the channels) does not exceed the number N of the chips of the midamble sequence. 
[0006] A biased channel estimation algorithm shall exploit a subset only of the full-dimensional channel parameters, 

10 avoiding the redundancy of the former model and allowing a noiseless estimation under contained increasing of the 
error in the estimated parameters. The theoretical possibility to disclose a channel with a reduced set of useful param- 
eters is inferable from the channel models described in the Recommendation GSM 5.05 issued from European Tele- 
communication Standard Institute (ETSI). These models are nevertheless only temporal and represent the channel by 
means of a certain number of time discretized Channel Impulse Responses (CIRs). Most recent models which best 

J5 support beamforming algorithms are referred to the COST-259 Directional Channel Model (DCM). This is a stochastic 
channel model for Space-Time (S-T) systems that includes both azimuthal and temporal dispersions. The aforemen- 
tioned models take into account for Rayleigh flat and selective fading, shadow fading, Line Of Sight (LOS) path loss, 
Doppler effect, etc. 

[0007] Trade-off between noisy and accuracy in the estimate has been emphasized in the European patent 

20 application EP 01 830291 .9 filed in the name of the same Applicant SIEMENS ICN, designating as inventors: Vincenzo 
DEL BELLO, Monica NICOLI, Umberto SPAGNOLINI. The content of the patent application has been subsequently 
disclosed in the article [REF. 2] of Monica NICOLI and Umberto SPAGNOUNI, titled "REDUCED- RANK CHANNEL 
ESTIMATION AND RANK ORDER SELECTION FOR CDMA SYSTEMS", Proc. IEEE International Conference on 
Communications (ICC'01), May 2001 which is incorporated by reference. The article of [REF. 2] states the following 

25 parsimony criterion that the channel estimation should be inspired to; "It doesn't need estimate more parameters than 
the strict necessary". A first particular implementation of this principle was the subject matter of the former invention, 
while a quite different implementation of the same principle is the subject matter of the invention disclosed in the present 
patent application. Basing on the parsimony criterion the accuracy to which a channel estimate can be performed using 
a known training sequence inversely depends (approximately) on the ratio between the number L unknown of parameters 

30 to be estimated and the length L trBjn}ng of the training sequence, being said parameters, for example, taps of temporal 
FIR filters. Recurring to training sequences very length it should be possible estimate the L unknown parameters near 
exactly, because the variance of the estimation decreases. Choosing to estimate a lower number of parameters 
(l-unknown < Unknown) tne estimate should be run into an error, also termed biasing (polarization), due to the under- 
parameterization of the channel. Nevertheless in this event the estimation of the remaining parameters should be made 

35 better, because the ratio L ' un knowr/^tmining is ,ower tnan L unknowr/ L traning- Tne * ull-dimensional estimation of the channel 
matrix, named H or H LS indifferently in the following, requires calculation of K x M x W parameters, whose meaning 
has been already introduced, while a reduced-dimensional estimation requires a lower number depending on the skill 
in the exploitation of the a-priori information about the space-time propagation features along the radio path. 
[0008] Fig.1 graphically shows the parsimony principle relatively to a hypothetical RF scenario. The abscissa repro- 

^0 duces a scale proportional to the parameters to be estimated (without limitation the rank of phannel matrix) and the 
ordinate the MSE relative error ||H-H|p/||H|| 2 representing the accuracy of the estimate, being H the estimated channel 
matrix. Considering the hypothetical RF scenario upon which the fig.1 is based, the minimum MSE is obtained in 
correspondence of a parsimonious estimation using 3/8 of the maximum number of parameters. The MSE on the upper 
curve is the sum of two terms: the distortion error, that is large for low parameterization and depends on the charac- 

45 teristics of the propagation channel (delay/angle spreads, number of diffusive clusters, etc.), and the noise that in- 
creases with parameterization and depends on the interference level, other than the increasing of the ratio L unknowr / 

^training' 

[0009] The parsimony criterion implementation in the design of a multi-user receiver for uplink TD-CDMA involves 
some practical implications that could at first-sight discourage the design of a reduced complexity receiver. In fact, the 

50 parsimonious channel estimation seems not so parsimonious like its wording indicates, that because a full-dimensional 
estimation of the channel matrix H LS (H Least Squares ) is a preliminary condition essential in any case to the successive 
reduction step. Nonetheless in spite of a slight increment in the computational complexity and the introduction of some 
grade of polarization, a considerable performance is pursued as far as concerns precision of the estimate and simpli- 
fication in channel modeling. For this aim both the articles of [REF. 1] and [REF. 2] underline that an imperfect channel 

55 estimation degrades the successive data detection, specially when digital receivers adopt Joint Detection (JD) to coun- 
terfeit Multiple Access Interference (MAI) intrinsic in CDMA systems. As known, MAI during estimation and/or detection 
is due to many causes, i.e. intercell interferences, selective multipath fading (Rayleigh), imperfect chip synchronization, 
imperfect choice of coding sequences, which together concur to drive both the midamble sequences and the CDMA 
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signatures (convolution between impulse responses of semi-burst data and orthogonal spreading sequences) no more 
orthogonal each other, so that each user is interfered from the other ones and in its turn acts like an interferer for the 
others. 

[0010] Because of the Parsimony principle is strictly connected to the channel modeling, the COST-259 modeling 
needs to be briefly discussed with reference to the Figures 2, 3, 4 and 5. The following four radio environments have 
been proposed: Generalized Typical Urban (GTU), Generalized Bad Urban (GBU), Generalized Rural Area (GRA), 
and Generalized Hill Terrain (GHT). GTU channel has high time dispersion and low angular dispersion. GBU channel 
has high both the dispersions. GRA channel has low both the dispersions. GHT channel has low the time dispersion 
and high the angular dispersion. 

[001 1] With reference to the Directional Channel Model depicted in fig.2 it can be argued that the channel response 
depends on N independent local scatters (clusters), each described by multiphats with associated amplitude, delay 
and azimuth. All the cluster parameters are independent random variables; the probability density functions are as- 
signed according to the outlined propagation environment. In addition to the line-of -sight cluster (i.e. the Main cluster) 
there are some other clusters (Secondary clusters), their numbers are Poisson distributed, although for the sake of 
simplicity a main and a Secondary clusters are only depicted in fig.2. Considering a directive array of antennae having 
a steering vector a(6), being e the incidence angle between a planar wave and the direction of an Uniform Linear Array 
(ULA), under the unlimiting hypothesis that only one mobile station is transmitting, the channel impulse response es- 
timable from a directive array is: 



where: c indicates the actual cluster; N c the number of clusters; A/ r the number of rays which the cluster reflects towards 
the BTS; 9 c / is a relevant azimuthal direction; x c j is the delay of ray r c b and g(t) is the transmission pulse filtered by 
the reception filter. 

[0012] Fig.3 shows a temporal power distribution P(t) of the channel depicted in fig.2, P(x) has exponential decreas- 
ing for each relevant cluster and Sx,1 , Sx,2 represent the time dispersion of main and secondary cluster respectively. 
[0013] Fig.4 shows an azimuthal power distribution P(9) relative to a cluster of the channel depicted in fig.2. P(0) is 
a Laplace distribution decreasing with 0. 

[0014] Fig.5 provides an alternative representation of the channel depicted in fig.2 relative to the GTU, GBU, GHT, 
and GRA channels. Depictions concern Power Delay Angle (PDA) diagrams having in abscissa the Direction Of Arrival 
(DOA) values in degrees and in ordinate the Time Of Arrival (TOA) values in us. The resolution is limited both temporally 
from the UMTS frequency band (near 4 MHz) and spatially from the number of the directive array elements (M = 8). 

OUTLINED TECHNICAL PROBLEM 

[0015] The problem of channel estimation with complexity reduction faced in the cited EP 01830291 .9 patent appli- 
cation was prevalently addressed to the radio environment scenario depicted in fig.2 in which the contribute of the 
various clusters seems more immediate than PDA diagrams of fig.5. In fact, the scenario of fig.2 suggests a limitation 
of the full-rank channel matrix to the only most significant eigenvectors representing in some ways "to be determined" 
the most significant clusters. For this aim the claim 1 of the aforementioned application recites approximately as in the 
following: 

"A Base Station receiver connected to a directional array of M antennas for receiving burst of data from K mobile 

stations of a cellular telephony system, the receiver having: means suitable for carrying out a full-dimensional 

multi-channel parameters estimation of a space-time channel model affected by AWGN noise and spatial interfer- 
ence, and means for multi-user detection in presence of noise and interference characterized in that said multi- 
channel parameters estimation means further includes: 

• reduction-complexity means for re-estimating said full-dimensional space-time channel model with a reduced 
set of re-estimated parameters in order to obtain one or more cascades, each cascade including a beamformer 
and a temporal FIR filter, being said cascades decoupled each other and their numbers depending on the 
specific user; 

• reduction-complexity control means arranged for limiting the re-estimation to a number r k of said cascades for 
each of the K users, r k being chosen dynamically as the number of independent cascades that minimizes the 
redundancy of the full-dimensional space-time channel model according to the information theory criterion 




Eq.1 



C=\ 1=1 
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(Minimum Description Length cited in the disclosure)". 

[001 6] Clearly each of said cascade takes into account for spatial and temporal dispersion of a single cluster of the 
ones the channel of fig.2 is composed with. The approximation with realistic propagation channel model COST-259 

5 for UTRA-TDD (UMTS Terrestrial Radio Access * TDD) holds r k values spanning half the whole range. Simulation 
results have proven that the adaptive RR estimate in comparison with Full-Rank estimate has a lower normalized MSE 
versus SNR/SIR (Signal to Noise Ratio / Signal to Interference Ratio) in the whole dB range. 
[0017] Although the performances of the adaptive Reduced Rank receiver are good, the inventors were convinced 
that further improvements could be achieved whether some other ways were explored to face the technical problem 

io of under-parameterizing the full-complexity channel model. That because the channel estimation means of the afore- 
mentioned RR receiver are complicated because of the particular eigenvectors decomposition of the Full-Rank channel 
matrix, and the MDL (Minimum Description Length) re-estimate for reducing the dimension to meet the effective spatial 
and temporal characteristics of the only relevant clusters. A desirable goal in the outlined technical field should be the 
search of news reduction-complexity means simpler than the preceding ones and equally adaptive to the change of 

is the radiofrequency environment. Fig.5 gives a hint in this direction clearly showing that the energy is variously spotted 
on the surface of the PDA diagrams and rapidly vanishes outside the spots, so that the majority of the surface is void. 
The spotted diagrams of fig.5 are consistent with the preceding Figures 2, 3, and 4 where it can be argued that a high 
temporal dispersion depends on the number of involved clusters, but as far as concerns the single cluster the time 
dispersion is limited, so that the various samples are space-time correlated. 

20 

PURPOSES OF THE INVENTION 

[0018] Purpose of the present invention is that to indicate an alternative solution to the problem of under-parame- 
terizing the Space-Time channel estimate. The proposed solution must be able to simplify the known under-parame- 

25 terization strategies, in particular those based on the rank reduction of the channel matrix. The proposed solution has 
to further reduce the MSE estimating error and has to be adaptive with the change of the radiofrequency environment. 
[0019] A welt appreciated requirement of the invention should be that to provide a simple criterion for selecting the 
only significant elements of the full-dimensional channel matrix and disregarding the ones considered excessively 
noisy. The criterion shall be usable with cellular systems of UMTS (CDMA) type and GSM (single-user-per-channel) 

30 type, in both cases exploiting set of antennas either directive arrays or not bind to a particular geometry. 

[0020] Other purpose of the invention is that of indicating a solution suitable for Rayleigh fading model, exponential 
fading model, or other suitable fading model in accordance with different power attenuation profiles. 
[0021] Other purpose of the invention is that of indicating solutions for any antenna spacing provided that the delays 
across the array is a fraction below the temporal resolution of the signal. The indicated solutions should be suitable 

35 either for uncorrelated or correlated fading, the last concerning closely spaced antennas (A/2 spaced apart) where the 
fading amplitude is correlated among the various antennas, i.e. in first approximation it differs by a phase shift (fixed 
delay) between two adjacent antennas depending on the direction of the considered path. 

[0022] Other purpose of the invention is that of indicating a solution able to exploit the slot-invariance of some char- 
acteristics of the space-time channel in time slotted CDMA/GSM systems. For completeness, each time slot corre- 
40 sponds to the length of the transmission burst, and said invariance involves successive frames. 

[0023] The aforementioned purposes are directed to the indication of a method for under-parameterizing the channel 
estimate in radiomobile Base Stations exploiting set of antennas, and to a receiver based on the indicated method. 

SUMMARY OF THE INVENTION 

45 

[0024] To achieve the purposes of the invention the subject of the present invention is a method for under-parame- 
terizing the channel estimate in radiomobile Base Stations exploiting set of antennas, as disclosed in the claim 1. 
[0025] Further subject of the invention is a radiomobile Base Station receiver including a channel estimator operating 
in conformity of the claimed method, as disclosed in the receiver claims. 

so [0026] The method of the invention criterion makes use of a classification algorithm for the selection of L < W useful 
samples. The basic assumption is that in a multipath channel with Rayleigh fading the elements of the channel matrix 
can be described by simple probability density functions (pdf s). For this aim the LS channel matrix shall be spatially 
and temporally pre-whitened to make spatially and temporally uncorrelated the estimation noise. The classification 
algorithm on the basis of the pre-whitened observed samples summed up to a noise term n h supposed Complex 

55 Normal (Gaussian), with 0 mean and variance o n 2 [n h ~~ CN{0,a n 2 )], is able to decide whether the sample either belongs 
to the useful or noise subset. The statistical process governing the two subset is described by Rayleigh probability 
density functions with different variances depending if the samples are estimation noise or useful noisy samples. In 
general, the samples \h,\ 2 or any linear combination is described by Gamma pdf. A threshold S is needed for discrim- 
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inating between the two subset. Threshold S is calculated in order to minimize the probability of erroneous classification. 
At the beginning of the classification algorithm ail the useful statistical parameters 



parameters of a Rayleigh mixture have been disclosed in the article [REF. 3] of Umberto Spagnolini (co-inventor) titled: 
"Cancellation of Polarized Impulsive Noise Using an Azimuth-Dependent Conditional Mean Estimator", published in 
IEEE Trans. Signal Processing, Vol.46, No 12, pp.3333-3344, Dec. 1998. Estimates are initialized by adopting a single 
user-defined starting point consisting, for the case of the invention, on the initial value of the only variable p U sefut Then 
said statistical parameters are iteratively recalculated until the dimension L of the reduced set remains unchanged. 
Only few iterations are needed to set the final value of the threshold S valid for each observed sample \h t \. Although 
the teaching of this article is useful, some conceptual difficulties arise to the one skilled in radiomobile field in the 
discover of this article because it is directed to quite a different technical sector involving magnetotelluric methods for 
subsurface investigation, i.e for earthquake prediction, underground prospecting, etc. As known, these methods are 
based on measurements of ground electromagnetic fields characterized by low frequencies. The measured signal is 
only noise of different type, such as the natural noise (< 5 Hz) uniformly distributed on the azimuth, and the man-made 
electromagnetic interferences (EMI; < 1 kHz) which tend to be polarized in fixed azimuthal directions coinciding for 
example with power line, electrified railways, and grounding devices. The teaching of the article is focused on the only 
signal detection for separating the non-Gaussian impulsive noise from the Gaussian natural noise, in order to cancel 
the first. Channel estimation is not mentioned in this article and suggestions in the direction of generating a particular 
mask for masking the channel matrix cannot be inferred consequently. From the differences pointed out it can be 
appreciated that a radiomobile channel affected by microwave multipath fading and intercell interferences represents 
quite a different scenario with respect to magnetotelluric, although both of them give rise to the need of classifying the 
observed samples and estimate the involved statistical parameters. 

[0028] Two classification algorithms (A and B) are disclosed; the algorithms differ each other by a different polari- 
zation of the estimate of said statistical parameters. The ratio y= o 2 useful fo 2 noise ls an useful indicator for tracing the 
polarization entity of the various parameters involved in the estimation. 

[0029] The Rayleigh fading model can be modified in order to take into account different power delay profiles. In 
accordance with the selected profile t the statistical process governing the two subset of classified samples is described 
from probability density functions of the same type with different variance. The selection criterion needs to be modified 
accordingly, but substantially the same classification algorithms are implemented being the most relevant modifications 
charged to the expression of the threshold S which is directly derived from the pdfs. Any suitable pdf can be used in 
as many variants of the invention still employing a mixture model of the samples of the whitened channel estimates. 
Depending on the antenna spacing, central and non-central x-squared can be used; mixture of Gamma pdf is a sub- 
optimum solution as a generalization of threshold selected by Rayleigh mixture. 

[0030] Fig.6 shows a simplified representation of the channel matrix H LS valid for the tf h user, where each line vectors 
r lsi» n LS2» h LSm» "lsm include W estimated samples of the impulse response received on the rri h antenna in 
correspondence of each midamble chip transmitted from the user. The opportunity of considering CDMA user each 
other independent descends from having supposed orthogonal the effective midamble sequences. Line vectors h LSm 
of the matrix are like FIR filters representing the channel at the corresponding rri n receiver. From the observation 
of the PDA diagrams of fig. 5 and deducing a resemblance with the channel matrix H LS of fig.6 (in all effect each PDA 
representation is comparable to a battery of FIRs placed side by side as the considered DOA stripes), the Inventors 
thought that, under particular hypotheses, a simpler way to reduce the Space-Time complexity of the H LS matrix should 
be a band-mask able to change H LS into a H STB matrix like that of the tf h user depicted in Fig.7. So doing the only 
zones having relevant energy are really considered, while a great area is disregarded. This assumption is independent 
from the particular geometrical displacement of the antennas equipped in the BS, the only condition to be verified is 
that the difference between the TOA of corresponding echoes evaluated for each couple of antennas be lower than 
temporal resolution of the channel estimate. A strict condition is for TDD-UTRA system with f 0 = 2 GHz and B = 3,84 
MHz, where the temporal resolution is only 260 ns. The aforementioned condition is surely met by closely spaced 
antennas, for example an array of M antennas X/2 spaced apart (for TDD-UTRA XI2 is near 7,5 cm), receiving signals 



(i.e.! S, PuseM, noise* useful) 



are unknown, so they are to be estimated. 

[0027] Some useful hints in the direction of iteratively estimating unknown 
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whose band is much lower than the carrier, as in the known cellular systems. In such a case the fading amplitude is 
correlated and the echoes impinge the M antennas with a delay only depending on the distance between two adjacent 
antennas. As the distance between the antennas departs from A/2, still leaving the same delays to the array, the fading 
amplitude correlation diminishes; nevertheless it is possible neutralize these effects by means of a space merging 
5 parameters which decreases as the maximum distance between adjacent antennas increases. This parameter is the 
percentage of the samples classified as useful at the same time instant on all the M antennas. Above a fixed percentage 
at) the remaining column elements of the H LS channel matrix are taken as useful too, under this percentage even the 
useful elements are considered as noise. A percentage of 30% has been tested to be suitable for all practical space 
diversity systems spanning from 10 to 15 X the maximum distance between two adjacent antennas. Fig.8 shows the 
w energy distribution inside the Band Masked channels GTU, GBU, 6HT and GRA, from which it can be appreciates 
that the mask approximates the effective displacement of the energy among the degrees of freedom of the channel. 
[0031] The under-parameterization method of above builds the matrix H STB of Fig.7 in two steps starting from the 
H LS matrix. During the first step the classification algorithm selects L < W useful elements of each h LSm vector using 
the same statistical criterion for all the M antennas. At the completion of the first step the H LS matrix results space- 
is time masked with a space-time mask matrix M ST (for the user) depicted in f ig.7a. The second step of the method 
performs a space-time equalization of the M ST mask matrix which generates a band mask matrix M STB like that visible 
in ffg.7b. Band mask M STB is usable for obtaining the band-masked channel matrix H STB of fig.7. Looking at the H STB 
matrix it can be argued that the final selected/unselected elements of the H LS channel matrix are those referred to the 
same time instant on all the M antennas, so that the method for obtaining it can be named Temporal Masking.The 
20 introduced masking concepts appear more immediate in the case of a variant of the invention suitable for closely 
spaced antennas where the fading amplitude is correlated among the various antennas. Fig.9 shows the time going 
of the square module of the channel estimate at the various antennas 



for the k* h user. Figures 7a and 7b, both aligned below the fig.9, show the evolution of the masking procedure towards 

30 the exploitation of all the possible spatial and temporal correlations of the channel. 

[0032] In a variant of the invention suitable for time slotted CDMA/GSM systems a multi-slot classification is per- 
formed. The opportunity of this variant is due to the fact that while the faded amplitudes are changing the temporal 
locations of the mask are not changing slot-by-slot. In such a variant, firstly, the spatial pre-whitening of the channel 
matrix H LS is performed using a spatial filter averaged on a certain number NS of sequential time slots; secondly, all 

35 the useful statistical parameters (S, p use f^ a 2 noise* ° 2 useful) are estimated averaging on an extended set of samples 
belonging to the same NS number of time slots. Apart from an initial delay of NS x T burst , spatial pre-whitening and 
classification operations concerning the actual time slot are performed by simply updating with the new H LS matrix 
samples both the multi-slot spatial filter and the estimation of statistical parameters. 

[0033] A further under-parameterization attempt has been carried out executing the above classification on the ele- 
*o ments of a channel matrix whose rank has been already reduced by using channel estimators of the prior art, for 
example the adaptive Reduced Rank channel estimator of the preceding invention [see REF. 2] of the same SIEMENS 
ICN assignee. In such a case the reduced rank channel matrix had been termed H RRk = A k B k H , where: k indicates 
the k m user; (..) H is the transposed conjugated or Hermitian operator; A indicates a matrix including r beamformers, 
and B a matrix including r temporal FIR filters. This should increase the previously defined parameter y because the 
4 5 noise power has been preliminarily reduced and even a spatial complexity reduction already accomplished. In such a 
variant the temporal and spatial pre-whitening of the H LS matrix had already been carried out in the aforementioned 
adaptive-RR disclosure [REF. 2], so that further whitening are unneeded. It's useful nevertheless point out that pre- 
whitening in the adaptive-RR invention had a quite different purpose in respect of the pre-whitening adopted in the 
present temporal masking, so that the skilled man faced to temporal masking are not prompted from the preceding 
so pre-whitening. More precisely, while adaptive-RR estimation performs the whitening filtering for making each other 
orthogonal the whitened beamformers 



25 




55 




and the whitened temporal FIR filters 
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the Temporal-Masking instead performs the whitening filtering to make uncorrelated the estimation noise so that the 
elements of the channel matrix H LS can be described by simple probability density functions (pdf's). 
[0034] Consequently still subject of the invention is a method for under-parameterizing the channel estimate which 
combines the technical features of the Rank Reduction of the prior art with those of the Temporal Masking method of 
the claim 1, as disclosed in a further independent method claim. 

ADVANTAGES OF THE INVENTION 

[0035] From the mask matrixes M ST and M STB visible in fig.7a and 7b and the relative masked channels, it can be 
appreciated the presence of a large quantity of zeroed samples, so that their noise contributions disappear in the 
channel estimate and the confidence of the remaining estimated samples increases. The detector that uses those 
relevant channel matrixes is simpler consequently. Besides, according to the statistically determined threshold S, that 
is close to the optimum threshold as far as the MSE concerns, the masked area is automatically adapted to the variable 
condition of the channel. In that the proposed method differs from the routinely employed methods making use of a 
fixed threshold with respect to the channel energy, in order to select the most significant taps only of the channel FIR. 
Constant energy threshold is reductive because, as it will be proved in the following (fig.32), the energy fraction \\ that 
minimizes the MSE error of the reduced-dimensional model depends on the statistical distribution of the energy among 
the channel samples. So that only taking into due account the aforementioned statistical parameters the energy fraction 
H can be related to the optimum threshold. 

[0036] Among background art method the following article [REF. 4] is worth to be mentioned: "Channel Acquisition 
for Wideband CDMA Signals", authors Vinayak Tripathi et at., IEEE JOURNAL OF SELECTED AREA IN COMMUNI- 
CATION, Vol.18, No. 8, August 2000. The main problem faced in the article is to determine some uplink channel 
estimation algorithms for a single new user entering a wideband CDMA cellular system. In a second stage the problem 
of reducing the complexity of the preceding ML estimated channel model is considered. The focus of the proposed ML 
algorithms is that to estimate correct channel parameters in presence of multipath fading and multi-access interference 
(MAI), so that joint acquisition scheme is mandatory for estimating the overall interference. Not any hint is done to the 
opportunity of exploiting set of M antennas. In that the proposed complexity reduction algorithms are forced to be only- 
temporal, to say, obviously silent on features for reducing the space parameters. Notwithstanding that and willing to 
consider the only-temporal features, the disclosed optimal approach to reducing the number of taps of FIRs simulating 
the K channels is teased on maximizing a correlation performance metric (h c , /i r ) between the ML estimated channel 
impulse response h c and the unknown reduced set response h r (r< c). The proposed tap-reduction algorithm is com- 
putationally more complex and different from the Temporal Masking of the invention in subject (which is not based on 
the maximization of said correlation and designed for a set of antennas). Besides the same authors recognize that 
their algorithm is not optimal hence propose other simpler ad hoc alternative approaches. The topic of these approaches 
(Max-L r approach) is to simply pick the L r most significant taps of the L c taps in the ML estimated channel impulse 
response As far as it results from the Section titled "Numerical Results" referred to the various simulations, the 
reduced number of taps L r is taken arbitrarily without any considerations on the pdfs of the relative amplitude and the 
probability of surpassing a quasi-optimal energy threshold statistically determined. In that the method of Tripathi et al. 
should be included into the aforementioned routinely employed ones without interfering with the present invention. 
[0037] At this point a comparison between the two under-parameterization strategies, namely the Temporal Masking 
of the present invention and the preceding Adaptive Reduced Rank, is needed in order to confirm the initial intuitions 
of the inventors concerning the computationally efficiency of the TM method. A further comparison of said strategies 
with the combined RR-TM and the LS estimator is also needed. Relevant terms of comparison are the statistical dis- 
tributions of the number of parameters effectively used in the estimation of the GTU, GBU, GHT and GRA channels 
of fig.5. Other relevant terms of comparison are the graphics of the MSE estimation error versus SNR. A common 
estimating quality threshold Q_TRH is needed to halt the considered reduction-complexity processes immediately 
before the threshold is crossed down. The reciprocal of the expression indicated in fig.1 for the MSE error is taken as 
Q_TRH, and a 20 dB value seems as a good choice. The various under-parameterization strategies are faced to a 
statistical relevant number of independent samples for COST-259 channel simulations and the results are collected 
for obtaining the distribution functions visible in Figures 10 and 11. Flg.10 shows a family of curves representing the 
rank distributions for GTU, GBU, GHT and GRA channels in case the receiver uses an 8-elements array for receiving 
in the UMTS band. Flg.11 shows a family of curves representing distributions of the useful channel chips L for the 
same channels of the preceding fig.10. Unfortunately Figures 10 and 11 are not each other immediately comparable. 
[0038] Figures 12 and 13 reproduce the curves of Figures 10 and 1 1 using a common independent variable on the 
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abscissa referred to the Degrees of Freedom (DF). From the comparison it can be noticed that the Temporal Masking 
offers better opportunities than adaptive-RR to estimate the same channel model with a reduced set of parameters. 
The following comparative table reports the obtained DF values for a 90% probability P. 





GRA 


GTU 


GHT 


GBU 


Adaptive-RR 


126 


186 


244 


300 


Temporal Mask 


48 


104 


128 


200 



[0039] In comparison with the full-complexity LS channel model of the conventional approach, where the estimated 
Degrees of Freedom relative to the single user is MW, the two reduced-complexity models have the following DF: 

r k (M+W)- r k 2 , usually « MW 

for the adaptive- RR model; 

LM<r k (M+W)- r? 

for the Temporal Masking model. 

[0040] Figures 14 to 17 are referred to respective COST-259 channel types in a simulation context like the one of 
Fig. 1 0 and 1 1 , with the only classifier A. Each Figure shows four normalized-MSE comparative diagrams of the channel 
estimate versus SNR, as in the framed Legenda. The reduction-complexity techniques allow an improvement on the 
estimate valued from 6 to 12 dB in terms of SNR. The Temporal Masking alone outperforms the preceding adaptive 
RR for all the considered channels. Figures 14 to 17 in conjunction with fig. 12 and 13 further allow the following con- 
clusions: 

• GRA channel thanks to its intrinsic lower complexity is that which more benefits from the use of the proposed 
under-parameterization methods. 

• GBU channel is highly complex, this limits the skill of the various estimating algorithms in the MSE reduction. In 
fact, for high SNR all the estimations converge towards the full-complexity solution. 

• For low SNR the Temporal Masking classifier get worse than for higher SNR (but still better than the adaptive RR). 

• The value of the parameter y improves if a preliminary rank reduction is performed. This is shown in the Figures 
where the MSE of the RR+TM estimate doesn't show the sharp worsen for the lowest SNR. 

• The combined action of the preceding adaptive RR and the present TM estimate has not an improvement (in 
respect of the ML estimate) equal to the summation of the singular gains. This highlight the fact that the RR estimate 
has already partially operated on the time dimension. 

• The incremental improvement of the RR+TM estimate is limited in respect of the TM only. So the relative advantages 
are presently not apparently enough to justify the additional computational cost, especially needed in channel 
spatially complex like the GHT and GBU ones. This conclusion should be reversed in the future thanks to the 
continuous technological progress of the devices. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0041 ] Further objects and advantages of the present invention will be made clear by the following detailed description 
of an embodiment thereof and the annexed drawings given for purely non-limiting explanatory purposes and wherein: 

fig.1 shows a graphical representation of a principle which inspires the method for under-parameterizing the chan- 
nel estimate of the present invention; 

Figures 2 to 4 show useful representation of the channel model COST-259; 
fig.5 shows PDA diagrams of the channel model COST-259; 

fig.6 shows a single-user representation of the M-antennas channel matrix H LS (Least Square); 
fig.7 shows a matrix H STB obtained from band-masking the channel matrix H LS of fig.6; 
f ig.7a shows a Space-Time mask matrix M ST ; 
f ig.7b shows a Space-Time mask matrix M STB ; 

fig.8 shows the band masked channels relative to the PDA diagrams of fig.5; 
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f ig.9 shows Af time diagrams of as many impulse responses estimated from the signals received on the A/f-antennas 
of fig.6; 

fig. 10 depicts a family of curves representing as many rank distribution functions of the channel matrix obtained 
with an adaptive reduced-rank method of the prior art; 

fig.11 depicts a family of curves representing as many distribution functions referred to the method of the present 
invention; 

Figures 12 and 1 3 depict the preceding families of curves referred to a common independent variable on abscissa; 
Figures 14, 15, 16 and 17 show for each COST-259 channel type some normal ized-MSE diagrams versus SNR 
comparatively referred to various channel estimators; 

fig. 18 shows a typical transmission burst of a TD/TDMA-CDMA UMTS radiomobile system; 

fig.19 shows a channel model representation for single user and single antenna; 

f ig.20 shows a channel model representation for K users and single antenna; 

fig.21 shows a channel vector for K users and single antenna; 

fig.22 shows a midamble channel matrix for K users and single antenna; 

Figures 23 and 24 show some matrix expressions for obtaining the received signal and the channel matrix for K 
users and M antennas; 

Figures 25 and 26 depict probability density functions (pdf) of selected and unselected estimated samples effected 
by a classifier used in the method of the present invention; 

Figures 27, 28, 29 and 30 show comparative diagrams of as many parameters estimated by two different classifiers 
available for the method of the present invention; 

Figures 31 and 32 show diagrams concerning the optimality of a classification threshold; 

Figures 33 and 34 show diagrams of comparative estimation performance in presence of spatially uncorrelated 

and correlated noise respectively; 

Figures 35a, 35b, 36a and 36c show different statistical set suitable for classification in as many multislot variants 
of the invention; 

f ig.37 shows a discrete time model of a CDMA system from the transmitted data to the received sequences; 
Figures 38a, 38b, 38c and 38d show the structure of some matrixes suitable for Data Detection comparatively 
referred to the prior art and to the method of the present invention, respectively; 

f ig.39 shows the structure of a radiomobile Base Station Receiver operating in conformity with the method of the 
present invention; 

Figures 40 and 41 show performance diagrams of a detector based on a channel model obtained by the method 
of the invention in presence of spatially uncorrelated and correlated noise respectively; 



DETAILED DESCRIPTION OF A FIRST EMBODIMENTS OF THE INVENTION 



[0042] With reference to fig.18 an uplink CDMA transmission burst is depicted in which a Training sequence in 
midamble position is visible between two Data blocks 1 and 2, the last being followed by a Guard Period GP to avoid 
interference with the successive bursts. The discrete time model is obtained by sampling at the chip rate 1/T C the 
signals received from the antennas after the chip matched filter. Within the same cell, in the same frequency band and 
in the same time slot, K users are assumed to be simultaneously active. Each mobile station (MS) is equipped with a 
single transmitter antenna, while at the base station (BS/BTS) an antenna set of M elements is employed. The users 
transmit in bursts each consisting of the two data blocks and the user specific training sequence (midamble) for channel 
estimation. Each data block contains N s modulated symbols of duration T s = QT C spread by a user-specific signature 
c k of Q chips for the tf h user. The midamble consists of N m = N+W-1 chips, W is the dispersive length of the channel. 
For example, in a first case referred to the specification concerning the TDD-UTRA, N=Kx W= 456 chips (K= 8 and 
W= 57) while in a second case referred to a CWTS standard, A/= 128 chips (K = 8 and W= 16). At the BTS side the 
received signals are separated into two subset of samples, the first one depending on the training sequences in mi- 
damble position and the second one depending on the two data blocks. The conventional approach for the estimation 
of the S-T channels from the samples of midamble is based on the Least Square (LS) estimate of the coefficients of 
the FIR filters summarizing the channel model. Then a S-T multi-user detection is carried out on the two data blocks 
received before and after the midamble, in order to estimate the 2/VcKchips belonging to the two data blocks. Detection 
needs the preceding knowledge of the channel estimates and the spreading code sequences for fc=1,...,K. The 
multi-user detection for space-time channels is known, for example, from the article of X. Wang, H. V. Poor, "Space- 
time multi-user detection in multiple CDMA channels", published in the review IEEE Trans. Signal Processing, Vol. 47, 
pp. 2356-2374, September 1 999 [REF. 5]. This type of detection is useful to contrast with the Multiple Access Interfer- 
ence (MAI). During the channel estimate correlative sequences {s k } between the midamble of the k m user and the 
relevant channel samples are effected. Due to Inter-Symbol Interference (ISI) introduced by the channel the first W-1 
chips of each correlative sequence {s k } are interfered from the signature tail of Data block 1 , while the last W-1 chips 
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of the same sequence {s k } superimpose as many first chips of the signature concerning Data block 2. The way to 
cancel this type of interference before channel estimation and data detection is well known in the art and the argument 
is not further considered hereinafter. 

[0043] Figures 1 9 to 24 show various model for channel estimation and the involved vectors and matrices at different 
level of complexity. Fig.19 shows a simplified channel model for the single user and the single antenna. With reference 
to the figure: J k ' m) represents the rt h midamble chip of the kP 1 user received on the mt h antenna; h< k * m > is the discrete 
time channel impulse response of length Wfor the link between the tf h user and the m f/J antenna; r/ m *is a noise 
assumed to be Gaussian in the amplitude, with zero mean, temporally uncorrected but spatially correlated; finally 
y(fr.m) re p resen ts the signal received by the mP 1 antenna. For the sake of simplicity h( k > m ) is a FIR filter whose coefficients 
include the convolution with the transmission and the reception filters having raised cosine frequency responses with 
22% roll-off for the standard considered here. 

[0044] The model of fig. 1 9 is the simplest one and can be used to built the successive more structured models, such 
as that of fig.20 referred to K users simultaneously transmitting. In such a case the contribute of the various channel 
are summed up at the receiving antenna. Fig.21 shows the resulting channel vector h< K ' m > for the K users on the nV A 
antenna, and fig.22 shows the structure of the relative midamble matrix X for the K users obtained by arranging the 
N mi(J midamble chips into this matrix KW x N mi(J dimensioned. Figures 23 and 24 show the structure of the various 
matrices obtained by the generalization of the model of fig.19 to the case of M antennas and K users simultaneously 
active on the same cell. 

[0045] The signal received at the time T c by the set of M antennas is: 

y^=SH (i) X W + n„ Eq.2 

(here 1 < k < Kand 1 < m < M). Matrix 

(where (...) T is the transposition operator) represents the S-T channel matrix for the k* h user; it consists of M vectors 
each representing the discrete time channel impulse response h(^ m ) of length W visible in the preceding Figures; x*is 
a vector generated from the k* h midamble; n n is the noise still assumed to be Gaussian, with zero mean, temporally 
uncorrelated but spatially correlated. Under these hypotheses the noise expectation value is 

where R ns is the space-covariance matrix normalized such that 

that takes into account for the multi-access interference, and 5 is the Kronecker delta (8 k = 1 if k = 0, 5 k = 0 for every 
k * 0). Recall that M > 1 ; being M = 1 a special case of S-T channel when the spatial diversity degenerates; this case 
is not extensively discussed but it can be derived from the general discussion herein. As previously said the estimation 
of HW is based on the transmission of K different training sequences 

known to the receiver. Let N mid = N, by arranging A/time samples of the matrix 
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the multi-user model for channel estimation can be written using the standard notation: 



Y = jH ( * ) X ( * ) +N = HX-hN, Eq.3 

Jt=i 

where 



xw=[ x w-,ij)] 



denotes the W x N convolution Toeplitz matrix for the midamble of the k* h user, 



H = [H (,) ,-,H< Ar) ] 

and 



x=[x (1)r ,-,x ( ^f 

are multi-user matrices respectively obtained by concatenating the K channels and the K training sequences. Matrix 

N = [n,,-,n„] 

models both the overall additive ambient noise and the inter-cell interference. 

[0046] The Least Squares estimates of the S-T channel matrix H and the consequent estimate of the space covar- 
iance matrix R ns respectively are: 

A A A 

H=R yx Ri, Eq.4 

A A A A A A A 

R ns = R yy • R >* R xx R yy * H R xy Eo .- 5 

where: 

A 

• H yx = YX H /A/ is the overall (intended as multi-user and multi-channel) cross-correlation between Y and X matrices, 
fteing N the considered chips; 

• = XX H /N is the overall autocorrelation matrix of the midambles; 

• Ryy = YY H /A/ is the overall autocorrelation matrix of the samples of the received signals. 

A 

[0047] In Eq. 5 the space covariance matrix R ns is estimated from the residual q{ the channel estimation. 
[0048] According to the A block structure of Jfie matrix H the channel estimate HW for the tf h user is obtained by 
selecting the tf h block of H. The estimate of H can be advantageously performed in known way, even different from 
the direct solution of Eq. 4, for example by means of the Fast Fourier Transform (FFT) and its Inverse. 
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A 



[0049] Let 5H = H - H be the error of the LS estimate, the covariance matrix of the estimation error R H can be evaluated 
by stacking the columns of the matrix 5H into a vector Sh = yec{5H}; from the article of [REF. 2] it follows: 



where ® is the Kronecker matrix product whose properties are disclosed by A. Graham, in the yplume titled: "Kronecker 
products and matrix calculus", edited by Wiley, 1981 . Eq. 6 recalls that asymptotically (N -4 ~) R ns -> R ns . For (spatially) 
uncorrelated noise R ns = o 2 l M training sequences with 




and ideal correlation matrix R xx = \ KW , being \ M and l^two identity matrices. The MSE error of the LS estimate is 



where o 2 0 = a 2 /N. For each channel the MSE error depends on the ratio between the total number of unknowns in the 
channel estimates MW and the training sequences length N. As N is limited and fixed, the MSE error can be reduced 
by reducing the unknowns to be estimated only to those "really necessary". 

[0050] The reduction of complexity gives rise to a minimum problem. Let h(8) be the re-parameterization of the 
channel matrix for the user in term of a vector of new parameters 6, the re-estimation is carried out on the LS estimate 



R H =£[«5H5H w ]=R- I i®R„ 



Eq.6 




or equivalent^ the estimation error 



[5Rl iW ~ N (0,<r o 2 ) 



h (i) =vec{H (i) } 



as the non-linear regression: 



§ = argmin 




Eq,7 



where R w is the block of R, 



H 



H 



(i.e.R ( * ) = R£ H ®R n ,). 



Optimization of the Eq. 7 implies that the re-estimation has to be carried out after temporal 
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and spatial 

pre-whitening of the LS estimate. 

[0051] The method of the invention in subject reduces the complexity of the S-T channel estimate by setting to zero 
those values of the LS estimate that are below a specific amplitude threshold S obtained from the statistical analysis 
10 of the pre-whitened LS estimate. The re-parameterized channel for the user becomes h(6) = J(*)h<*) where the 
"mask" is obtained by a diagonal matrix JW, entries of JM are 0 or 1 depending on the value of the samples of the 
channel estimate. The mask generates following estimation error. 

h W _ jMfiW = (i^ _ jW)h« - J(*><5h<*> Eq. 8 

whose square value is: 

MSE" = tr ft k W )H (J (i) - 1 m )} + /r {r£>J<*>} , Eq. 9 

v — ■ v ' v " v "* 

MSE H MSE M 

25 where tr is the trace operator which sums up the elements of the main diagonal of the matrix inside braces. The two 
contributions in Eq. 9 are the MSE obtained by masking the channel samples (MSE H ) and by unmasking the noisy LS 
estimates (MSE 5H ). The trade-off is obtained by determining the mask J(*) that minimizes the Eq. 9, since over-masking 
increases MSE H and under-masking increases MSE 6H . Recall that the following bounds holds true: 



20 



30 



50 



MSE H <\h (k) f and MSE m <tr^Y 



[0052] The starting point to^et the mask JW for the tf" user is the spatial-temporal pre-whitening of the LS estimated 
35 Space-Time channel matrix H. Let 



z <*> = *H/2* W *£>H/2 Eq10 



40 be the M x W block for the user after spatial-temporal pre-whitening and let L the effective samples of the channel 
be smaller thaR MW. The ZW matrix corresponds to the Z LS one visible in fig.6. Advantageously both the spatial K m 
and temporal rM h/2 filters can be replaced with the respective Cholesky factor, well known in the art. For the sake of 
simplicity in thelollowing the midambles are considered quasi-ideal so that R xx = 1^ and the temporal pre-whitening 
is unnecessary, the previous Eq. 1 0 becomes: 

45 

ZW-R-^AW Eq.10' 



Each complex valued sample 



55 (see fig.6) is the sum of a channel sample h and a noise term 6h supposed complex Gaussian with variance (also 
indicated in the following): 
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z mM =h + Sh. 4*»f J =cr o. N ~ CN (0,<7 2 0 ). 

so that z mw is a random variable. As concerns the chip-spaced samples h they are identically Rayleigh distributed 
(under a Rayleigh fading) with variance 

E\h\ 2 \=*l 

[0053] Let Vbe the set of the L held useful samples, the temporal complexity reduction hypothesis can be so sche- 
matized: 



z mw = 0 Vm,w£V Eq. 11 



z m, w = z m, w Vm.wEV Eq.11a 

and z m w belongs either the one or the other the two disjoint classes, or hypotheses: 

• hypothesis H 0 m,w$V hence z mw = 5H 

• hypothesis H, m,weV hence z mw =h + 8H 

[0054] The classification method (or classifier) that will be described distinguishes between these two hypotheses 
on the basis of the observed z m w . In order to simplifying the notation, let 




and p the probability that z belongs the hypothesis H v The unconditional pdf of the random variable z is given by the 
mixture model: 



f{z)=pf{z\H x )+{\-p)f(z\H a ) Eq.12 

In case the hypothesis H 0 is effective, z = and because \6h\ is supposed CN t z has a Rayleigh (R) distribution 
whose pdf(z) is: 



f(z\H 0 ) = R (z\H 0 ) = e lal for z > 0 Eq. 13 

In case the hypothesis H, is effective for Rayleigh fading, z = |h+8h| has still a Rayleigh distribution whose pdf(z) is: 



f{z\H , ) = R (z\H , ) = e' 2cl for z > 0 Eq. 14 

o*, 

where = + aj\ In case of Rayleigh fading Eq.12 is: 
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f(z)= pR (z\H ,)+ (1 - P )R (z\H 0 )= Eq. 15 



= e~ 2a < + (1 - i?>4 e' 2al . Eq. 16 



also known as Rayleigh mixture. Fig.25 shows two possible curves 

R{z\H Q ) 



and 



[0055] In accordance with the criterion of Eq. 11 the binary hypothesis test based on the Bayes criterion disclosed 
in the volume "DETECTION OF SIGNAL IN NOISE", author A. D. Whale, Edited by Academic Press, 1971 , [REF. 6] 
can be used in the selection. Bayes criterion yields to the following Likelihood Ratio Test (LRT): 



H i 



that discriminates each of the sample z either belongs to the noisy channel estimate (HJ or noise (H 0 ). The LRT test 
can be reduced to the following amplitude threshold test: 

z > S(cr 2 ,a 2 ,p), Eq.18 

the over-crossing of threshold S classifies the sample zto the set Vof the L useful samples, while the sample z< S is 
zeroed. The classification algorithm used in the present invention calculates the threshold S by minimizing the proba- 
bility of uncorrect classification: 

Pa =P£M" .)+(*- P)%A Z \ H o) E <- 19 

The threshold value z = S that minimizes the Eq. 19 is obtained from: 

5* 
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^-pf{s\H l )-(l-p)/(s\H 0 )=0 



Eq.20 



for the Rayleigh mixture the preceding Eq. 20 becomes: 

10 2 pSe lai - 2(1 - p)Se 2 * 1 = 0 

which gives for the threshold S the expression: 



15 



( 1 l) 


In 


<7, 1-/7 






V°0 °\ ) 




l°"o P J 



Mi 



20 which can be easily implemented once the pdf parameters 



25 



30 



Eq. 21 



Eq. 22 



are known. Fig.26 shows a threshold S= 2,44 for the Rayleigh mixture f(z) whose pdf components are those of fig.25, 
supposing the following values 



(cr 0 2 =l; cr 2 =6; p = l5%) 



known. 

[0056] Two classification algorithms, respectively named A and B, are proposed for classifying the samples of the 
35 set {z} as belonging to the two mutually exclusive hypotheses H , or H 0 . Both algorithms estimate iteratively the pa- 
rameters 



45 



of Eq. 22 and use them to calculate the classification threshold S, starting from an initial arbitrary probability p used 
for roughly classify the samples of set {z}. The two algorithms are distinguished by a different polarization of the esti- 
mate. Classification algorithm A includes the following steps indicated by an iteration index f. 

a) sort by decreasing values the set 



50 



55 



taw}. 



(v^1,...,l/V;n l :r1,...,M) 

b) assign an arbitrary probability p = p^; 

c) calculate the dimension L = p jn MWoi the useful set that for now includes the first L elements of the sorted set; 

d) estimate: 
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5 



15 



35 



55 



<*. 2= 7 Z*l»! Eq.23 



e) estimate 



al= J^T)J^-' Eq - 24 

f) estimate a threshold 

at the iteration (/) from Eq. 22; 

g) classify the MW samples of the sorted set 

20 

{ Z mJ 

25 by enforcing the criterion: 

Zm.m > S m, w G Vz s ™ = z mw Eq. 25 

* S =» m.* e V => z*™ = 0; Eq. 25' 

where z is the unmasked/masked sample. The criterion of above is equivalent to the following one operating 
on the itf^j mask matrix visible in fig. 7a: 



m,we V^M^w)^! 

40 m . w ^ V => M ST(m,w) = 0 

h) calculate as the new dimension of the set V; 

i) compare L( /+1 > with the preceding LW and stopping the classification if L(* 1 ) - otherwise estimate: 

p<' +1 » = i.(' +1 >/MIV Eq.28 

and repeat the previous steps from d) to iXcyclicaJly until the stopping condition is met. An equivalent way to stop 
the iterations is by checking the condition S( ,+1 > = SM at the step 0- Trivial condition needs to be checked separately 
50 such as L > 1 . 

[0057] The experimented number of iterations is less than ten. The last value of L determines the first L elements of 
the sorted set 
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to be considered as useful at the end of cjajssificatiftn and hence the content of the M ST mask matrix. 
[0058] The estimated pdf parameters p, a 2 , a 2 , S are biased. As far as the polarization concerns^ the classifier A 
tends to over-polarize the variance a 2 performed with Eq. 24. In fact it can be proved that for ratio y = a t /a 0 sufficiently 
large said estimate can be approximated with: 

S^S 2 + a 2 Eq. 27 

where a 2 is the true variance and S is the optimal threshold. Now if L is the number of samples of the set 



classified as on the basis of S, the ratio UMW should be a polarized estimation of the probability p, 

[0059] The classification algorithm B, first variant of the invention, differs from the algorithm A in that it tries to remove 

the outlined polarization from a 2 and p by identifying and subtracting the contribution over/under threshold not counted 



in both the expressions Eq. 24 and Eq. 26. A 

[0060] In the case of classifier B the variance a 2 is now estimated with the following expression: 

Eq. 28 



Li m,w c- V 



and the probability p becomes: 



UMW- e 



2c: 



p= _^_ Eq29 

the remaining expressions and the various steps are unchanged. A A A 

[0061] Figures 27 to 30 show as many comparative simulations of the estimated set p,o 2 f c?,S by the two classi- 
fication algorithms, and an ideal reference, in function of the parameter y. Fig. 27 clearly snows the tendency of the 
classifier A to over-estimating the parameter a 2 , while classifier B tends to under-estimating it, being however able to 
keep the polarization low for the lowest values of the parameter y. The most relevant quality indicator of ^cjassjfication 
method is the polarization of the estimated threshold S; that because even if the estimated parameters p, a 2 , a 2 were 
erroneous but they would bring towards the ideal threshold S, the classification would be correct in any case. Fig. 30 
shows that the optimum threshold is achieved by classifier B for values of parameter y lower than those allowed from 
the classifier A. In spite of that a comparison of the MSE of the estimation versus SNR between the two classifiers 
gives the classifier A better than B for all the COST-259 channels. 

[0062] Either selecting the classification algorithm A or B at the end of the classification the space-time mask matrix 
M ST of fig.7a is obtained. The mask matrix M ST is the result of a classification that doesn't take care of the suggestions 
of the PDA diagrams of fig.5. For taking said diagrams into account and obtaining, firstly, the band mask M STB of fig. 
7b and then the final matrix H STB of fig. 7 the following empirical steps have been sequentially introduced: 

j) Spatial merging: if a predetermined percentage of elements 

of a column w of the M ST matrix has been put to 1 , then all the elements of the w-th column are fixed to 1 too inside 
the mask M STB , otherwise are all fixed to zero. The corresponding columns 1 ,...,Wof the H LS matrix are taken as 
useful or zeroed consequently. In that the masked/unmasked elements of the H LS matrix are always referred to 
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the same time instant w on all the M antennas, in accordance with the Temporal Masking definition. The prede- 
termined percentage of useful elements is selected according to the criteria stated in the introduction. Fig.8 shows 
that spatial unconstraint of the relevant samples having the same delay allows to catch a large amount of the 
energy associated to this delay. Variations around the threshold of 30% are possible in concordance with prelim- 
inary analysis of the channel, 
k) Temporal merging: for each element 

of the M ST matrix fixed at 1 , the elements of the previous 

and next 



\Msr{m.w+i)) 

are both fixed at 1 . That because, in case of temporal correlation of the channel, the samples preceding and 
successive a useful sample are very likely to be useful too. Consequently if columns w-1 and w+1 of the M ST matrix 
were set to 0 in the preceding step j) all the elements of these columns are turned to 1 in the mask M STB , in order 
to extend of a 2w chip-time amount the two sides of the unmasked bands. The final result is the Z STB matrix visible 
in fig.7. The usefulness of this step is recognized when recovery of timing is not very accurate. Figures 2, 3, and 
5 shows that inside a single cluster the time dispersion is limited, so that inside a single cluster the various samples 
are time correlated. The extension of the unmasked band reduces the probability to consider as noise some useful 
samples. 

[0063] At the end of the two empiric steps j) and k) matrix Z STB is completed, thanks to the M STB mask. For brevity, 
the following arguments are indifferently referred to the H LS and H STB matrixes. The aligned Figures 9, 7a, and 7b 
summarize these two steps for closely spaced antennas. In FigJa the black areas are associated to estimated samples 

|h LS/ ] 2 of the upper fig.9 classified as useful. With reference to the letters a, f, the comparison between fig.7a and 

the lower fig.7b indicates the modifications introduced by steps j) and k). More precisely, the aforementioned step j) 
operates in the zones a, d, f while step k) operates in the zones b, c. 

[0064] Operatively, the H STB matrix can be obtained by masking the H LS matrix through a complete fA {k) srd M x 
W x W) masking matrix for the user. The masking matrix M ( ) (M x W x W) is decomposable into S /CP diagonal 
masking matrices J (/c) WxW), each referred to a respective m frt ^rltenna and therefore to a line vector h( m > of the H 
matrix. Each elemenTbn the main diagonal of (W x l/V) is 1 if the corresponding element is selected and 0 
on the contrary, and the remaining elements arefSfl zero. Each masked line vector h* m) (1 x W) of the H LSB (M x W) 
band matrix is obtained as in the following: LSB 

^s ) =J!X (m) . Eq-ao 

[0065] Spatial pre-whitening and ail the preceding steps a) to k) shall be repeated for all the K active users. At the 
end an overall mask for all the K users is: 



M Tar - U M $ = [m (1 \M ( V*,M (/ °] Eq.31 
which multiplies the overall H L$ (M x Wx K) channel matrix. 
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[0066] Before introducing some useful variants and disclosing the receiver, with reference to the Figures 31 and 32 
a theoretical analysis on the MSE performance of the classification algorithm for Rayleigh mixture is developed. In 
such a case the loss of optimality of the threshold 

iteratively estimated to minimize the probability of incorrect classification, with respect to the threshold that minimizes 
the MSE, can be evaluated analytically. Let us consider a large sample size so that (asymptotically) the MSE terms in 
Eq. 9 can be evaluated from the expectations over the single sample: 

MSE H = P m x E|_ \H 2 \h + n|<Sj Eq. 32 

MSE 6h = P fe x E L N 2 |/t|< Sj Eq. 32a 

expectations E[ ] are for the conditional pdf's; P fe , P & and P m denote the probability of a false alarm, detection, and 
missing, respectively. The error MSE(S) = MSE H (S) + MSE6 H (S) can be evaluated analytically from Eq. 32 and Eq. 
32a as function of the threshold S. Fig.31 shows the MSE(S) normalized to the mean channel norm (i.e. pa 2 ) versus 
Sfor Rayleigh mixture with varying a ft /a 2 = {6, 10, 20} dB and p = {0,1 , 0,35}. The threshold S MSE that minimizes the 
MSE(S) is indicated on each plot together with the threshold 

obtained by the classification algorithm A. It can be noticed that for large o^/o^ (or equivalent^ for large signal to noise 
ratios, as for uncorrelated noise a 2 = a 2 / A/) 

while for small o 2 /o 2 it is 
h o 

s{p t a 2 Xi &l)<S MSE 

with a slight loss of performance. Fig.32 shows some plots of the optimum energy fraction i\ that minimizes the MSE 
for varying distribution parameters 

The purpose of fig.32 is to confirm the conclusions drawn in the introduction about he known estimation method based 
on fixed energy fraction t), taken as discriminating threshold for selecting a subset of the channel impulse response 
samples h. The energy fraction tj that minimizes the MSE depends on the statistical distribution of the channel samples 
and it can be easily related to the optimum threshold S MSE . Analysis in Fig.32 shows that the energy fraction r\ that 
minimizes the MSE cannot be fixed but it should be selected adaptively according to the channel distribution parameters 

{p. crllel). 
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[0067] In the simulation concerning the Figures 33 and 34 the performances of the Temporal Masking channel 
estimate of the present invention (LS+TM) are compared with the LS estimate and the fixed energy threshold r\. The 
normalized MSE 



IAH| ! M) 



versus the signal-to-noise ratio 



10 



15 



20 



25 



35 



40 



SNR 



is shown for GTU channel. In f ig.33 the Gaussian noise is spatially uncorrelated, while in f ig.34 is spatially correlated as: 

[R«L=^{0,9e—r 

and the angle a of arrival of the interference is random within the support [-rc/3, n/3J. In both the Figures the adaptive 
threshold (LS+TM) outperforms the LS estimate of approximately 5 dB in SNR and it is uniformly better than fixed 
energy threshold rj. The simulations here show that the energy threshold r\ should be increased with the SNR, similarly 
to fig.32. 

DESCRIPTION OF ALTERNATIVE EMBODIMENTS OF THE CHANNEL ESTIMATION 

[0068] In addition to the first variant of the invention, referred to the use of the classification algorithm B, the following 
30 further variants are disclosed referred (for the sake of brevity) to the only classification algorithm A, although they can 
be easily adjusted to include the classification algorithm B: 

Embodiment for closely spaced antennas (directive arrays). 

Embodiment for time slotted CDMA/GSM systems exploiting a multislot algorithm. This variant is 
referred as: "extended-multislot and extended-antennas". 

Embodiment for time slotted CDMA/GSM systems exploiting multislot algorithm. This variant is re- 
ferred as: "compact-multislot and extended-antennas'. 

Embodiment for time slotted CDMA/GSM systems exploiting multislot algorithm. This variant is re- 
ferred as: "extended-multislot and compact-antennas". 

Embodiment for time slotted CDMA/GSM systems exploiting multislot algorithm. This variant is re- 
ferred as: "compact-multislot and compact-antennas". 
Embodiment for adaptive-Reduced- Rank followed by Temporal Masking. 

[0069] With reference to the 2-th variant, when the antennas are closely spaced the channel taps are spatially 
45 correlated. In a multipath model where each path is chip-spaced with one direction of arrival, the S-T channel values 
differs only by a linearly increasing phase shift: 



2-th 


variant - 


3-th 


variant - 


4-th 


variant - 


5-th 


variant - 


6-th 


variant - 


7-th 


variant - 



50 



n m,w = h \,w e 



Eq. 33 



The selection variable z can be modified neglecting this phase shift by summing up on all the antennas the power of 
the actual z elements: 



55 



Eq.34 
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[0070] The conditional pdf of power variable d w is a non-central chi-squared |x 2 ) with 2M degrees of freedom de- 
pending on the hypothesis H 0 or H, holds; the non-central parameter is A^wf^ . Eq. 22 for determining threshold S 
shall be modified consequently. The under-parameterization method already described for the first embodiment of the 
invention is modified accordingly using the same steps of the previous variant. In particular the two variances are: 

5 

^i 2= iZ^ and a] 

10 

the W elements of the sorted set {d w } is: 

d„>S=>wev^z™ M:w = zfJ M Mw 

15 

d w <S=>w$V=>z°™ U;w =0; 

STM 

where z is the unmasked/masked column. The criterion of above is equivalent to the following operating on the 
20 M ST masK'rrlatrix visible in fig. 7a: 



— 7 \ The criterion for classifying 

M\W -L) W tv 



m,w<=V=>M ST{ , ^-1 

m,w€V=>M ST{ i M;w) =0. 

[0071] Similar to previous variant, the experimented number of iterations is less than ten. The last value of L deter- 
mines the first L elements of the sorted set {dj to be considered as useful at the end of classification and hence the 
so content of the M ST mask matrix. At this point of the procedure the space merging of the preceding step j) is unnecessary 
because intrinsic in the definition of variable cy Eq. 34). Preceding step k) is performed for extending, if necessary, a 
chip time wthe two sides of the unmasked bands. 

[0072] Figures 35a, 35b, 36a, and 36b are referred to respectively 3-th, 4-th, 5-th, and 6-th multistat variants, 
whose aim is that to improve the performance of the proposed classification algorithms thanks to the use of a wider 

35 statistical sample for estimating the involved statistical parameters (a 2 a 2 , p). In that the convergence of the classifi- 
cation algorithms is sped up and a possible influence on the final results from the arbitrary choice of the initial probability 
p is eliminated. Popular examples of slotted cellular systems are the GSM/GPRS and the various proposed UMTS/ 
CDMA like TDD-UTRA or CWTS. GSM exploits sequential frames 4,615 ms long, each frame includes 8-timeslots 
assigned to the users one to one. TDD-UTRA frames are 10 ms long and include 15 timeslots freely assignable in 

40 uplink or downlink, with the restriction to have at least one timeslot for each direction. The premise of all the multistat 
variants is that some channel characteristics influencing the estimate remain substantially unchanged during a time 
window spanning a reasonably number of frames. Relevant invariable characteristics are channel delay and attenuation 
profile (not fading which can be variable). The number of permissible slots mainly depends on the mobile speed. The 
length of the time window is selected in order to prevent one chip sliding of the channel delays due to the mobile speed. 

45 From the Doppler theory about cellular system has been found that with a speed of 300 km/h and tolerating a time slip 
of half-chip duration (Tc = 260 ns for TDD-UTRA), the correlative windows reaches 360 ms, to say 36 frames. As far 
as the most suitable statistical distributions concerns (and the homonym pdf-s) a Rayleigh pdf is used if the uncorrelated 
elements are not summed up along space-time coordinates, instead, when they are summed up the chi-squared is 
needed. Difficulties to estimate the threshold S in closed form are circumvented from the use of an Erlang distribution 

so in conjunction with an exponential decreasing power profile for the channel samples. A most general approach makes 
use of a gamma distribution from which chi-squared and Erlang distribution both descend. Theory of these distribution 
functions are given in the volume "Probability, Random Variables and Stochastic Processes", author A. Populis, Mc- 
Graw-Hill International Editions, Electrical & Engineering Series 1991 [Ref 7]. 

[0073] Fig. 35a shows the 3-th variant suitable for "extended-multislot and extended-antennas" which operates on 
55 an extended set Mx Wx NS dimensioned, being NS the number of time slots involved in the procedure. With reference 
to the fig.35a, let NS be the number of time slots involved in the procedure, a spatial pre-whitening of the channel 
matrix H is performed as in the following: 



23 



EP 1 300 999 A1 



Rns-overcge " Z XTQ ' 35 



z - H n$-avemg9™ Ec T »0 

the number A/S is off-line determined from the observation of the stationariness of the channel and the maximum 
permissible mobile speed. A memory table is provided to store A/S consecutive Z<*> matrices pre-whitened as in the 
Eq. 10" constituting the input of the classification algorithm. The memory table could be, for example, a line vector G< k ) 
(1, MxWxNS dimensioned) including A/S serialized Z<*> matrices. After which the method for under-parameterizing 
the channel estimate already described for the first embodiment of the invention is modified for the 3-th variant as in 
the following (steps a) to i) are equally labeled): 

a) sorting the vector G(k) by decreasing values of the samples z m wns (w='\ IV; m =1 t ...,M; ns = 1 A/S). In 

order to simplify the next steps two vectors named (O w ) and I A (1, MxWx A/S dimensioned) are introduced. 
Vector 



(o<»JW,{ 2 L.J. 



vector K*) includes the indexes of the elements stored in (0**b l. Finally a multiple mask matrix M( k ) S j. multis!ot (M, 
WxNS dimensioned) that corresponds to NS successive M matrices like the one visible in fig.7a. More 
precisely: > 

([-NS)Tburst 

M^Lultisiot = U M£> w Eq.36 

ns-Tburst 

is a matrix of all zeroes except for some 1s in the (m, w, ns) locations corresponding to the locations stored in the 
first L elements of the vector |W. 

b) assigning an arbitrary probability p = p /n ; 

c) calculating the dimension L = p }n x M x W x A/S of the useful set that for now includes the first L elements of 
the vector 



d) estimating: 



(OMJ 1 ; 



L» m,w t nseV 



which is equivalent to 



1 t< 
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15 



25 



35 



40 



50 



referred to the vector O 2 ; 
e) estimating: 



0 (MxWxNS-L) m ,~*v 



which is equivalent to 



1 MxWxNS 

0 (MxWxNS-L) 



referred to the vector O 2 ; 
f) estimating a threshold 



at the iteration (/) from Eq. 22; 

g) classifying the M x Wx NS samples of the sorted set 



30 belonging to the respective vectors 



|(*) by enforcing the criterion: 



A 



STM 



Zm, w ,ns ZS^m,w,ns£V=> z*™ ns = 0; 



where z STM is the unmasked/masked sample. The criterion of abovels equivalent to the following operating on 
45 the M( k ) S T-muitisiot mas k matrix (omitting index k for brevity): 

m,w,ns S V=> M ST ^ ultislot{m wns) = 1 



m,w,ns £ V=> M ST _ multistot{m wns) =0 



The elements of the mask to be put at 1 are the ones addressed by the first L elements of the |W vector; 
h) calculating lS»V as the new dimension of the set V; 
55 i) comparing L< /+1 > with the preceding L<'> and stopping the classification if Z_(» 1 )= L(0 otherwise estimating: 

p (,+1) = L {M) IM X WX NS 
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and repeating the previous steps from d) to [) cyclically until the end condition is met. An equivalent way to stop 
the iterations is that to check the condition S^ 1 ) = SO at the step f). The experimented number of iterations is 
lesser than ten. At the end of the classification the set V includes the first L elements of the vector 

while the set V includes the remaining (M x Wx NS - L) elements. The content of the M ST mask matrix is con- 
sequently determined. 

j) Deep Temporal merging: this step of the multislot procedure exploits a further opportunity in respect of the 
single-slot embodiment consisting of a deep temporal merging of the NS masks composing the M ST4nuMistot matrix. 
The purpose of this step is that to collapse the M ST . muftis , 0 j matrix into a most reliable M ST mask valid for the actual 
masking period, to be submitted to the successive steps of the procedure. Operatively a mask M SUM (for the 
user) (Mx W dimensioned) is calculated at first summing up all the corresponding NS elements (m, w f ns) of the 
NS M ST masks composing the M ST _ multislot matrix as: 



^SUM 

M 

{m m (m.w)} 



where: j = w + ns x W. 
Secondly, the actual M ST mask matrix 

{m ST (m,w)} 

is obtained introducing a discretional probability p deep , for example: p deep at least 0,25 x A/S, such as if: 

{m SUM (m, w)} > p deep xNS => {m ST (m, w)} = 1 Eq. 38 

whose explanation is that in any case communication takes place. 

k) Spatial merging: merging is performed on the actual M SUM mask matrix as the equivalent step j) of the first 
embodiment of the invention performs on the M sx mask matrix (figJa). 

[0074] Temporal merging at step k) of the first embodiment of the invention is proved to be unnecessary for all the 
multislot procedures. 

[0075] All the considerations about the final masking operations for obtaining the H STB channel band matrix for all 
the K users are still valid. Masking cadence can be selected either slot-by-slot or block-by-block. The multislot procedure 
after an initial delay of NS x T^t reaches the steady state; the delay allows a best approximation of the space-time 
distribution of the energy by means of the actual masking. When slot-by-slot cadence is selected the H STB matrix valid 
for the actual burst has built exploiting the preceding (A/S-1 ) T bursP while if block-by-block cadence is selected the H STB 
matrix completed at the end of the preceding block is valid for the whole duration of the actual block. It can also be 
appreciate that the singe-slot procedure disclosed for the first embodiment of the invention can be obtained as a par- 
ticular case of the multislot procedure for NS = 1. 

[0076] Fig. 35b shows the 4-th variant for "compact-multislot and extended-antennas" which operates on a compact 
set M x W dimensioned. With reference to the fig.35b, this variant is derived introducing the following modifications 
in the preceding 3-th variant: 
a new classification variable 



Rum( W ' VV )} 
{raST-muIfclolO 71 './)} 

Eq.37 

: 2 J m ST-multisloA W Wj' 
ns=0 
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d = Yz 2 

m.w La m.w,ns 
ns=\ 



is determined at first from the variable z which follows a Rayleigh distribution 



w 



z ~ R(z,at). 

In its turn z 2 is reasonably assumed to have an exponential decreasing power profile 



15 



» ( ^ 



20 



The sum d m w of A/S random variables z 2 each with exponential pdf is an Erlang distribution: 



25 



d m . v - EH 



NS, 



i J 



the pdf of the random variable dis: 



30 



35 



• vector 



1(d) = 



(A/S-1)! 



e , 



Eq. 39 



40 



45 



and corresponding It*) are obtained consequently; 

the dimension of the useful set is g now L = px M x IrV and that of the noise set (Mx W-L) and the two variances 
are estimated from vector ((O w ) consequently; 

multiple mask matrix of Eq. 36 is unnecessary and the M w mask of Eq. 37 is directly determined; 
threshold SUM 



50 



is estimated from the following expression: 



55 



s(crl<7lp)= 



1 1 



v-1 



In 



_2 



I"? 



Eq. 22a 
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10 



15 



obtained minimizing the probability of uncorrect classification for Eriang pdf-s; 

deep temporal merging at step j) is unnecessary, but spatial merging at step k) is performed on the mask 
for obtaining the M w band mask. 



SUM 



STB 



[0077] Fig.36a shows the 5-th variant for "extended-multislot and compact-antennas" which operates on a compact 
set WxNS dimensioned. With reference to the fig.36a, this variant is derived introducing the following modifications 
inside the 3-th variant: 

• a new classification variable 



is determined at first; 
vector 



w,ns m,w,ns 
m=l 



20 



25 



and corresponding K*> are determined; 

the dimension of the useful set is now L = px Wx NS and that of the noise set (WxNS-L), the two variances are 

estimated from vector (O w ) consequently; 

multiple mask matrix (Eq. 36) is now a multiple mask vector; 

threshold 



30 



35 



40 



is estimated from the following expression: 



S{al,<rf,p)= 




Eq. 22b 



still obtained minimizing the probability of uncorrect classification for the Eriang distribution 



45 



d ~ Erl 



deep temporal merging of step j) is performed on the multiple mask vector for obtaining a cumulative mask vector 
similarly to Eq. 37; 

spatial merging step k) is unnecessary and the cumulative mask vector is mapped into the M <k> band mask. 



STB 



[0078] Fig. 36b shows the 6-th variant for "compact-multislot and compact-antennas" which operates on a super- 
compact set W elements long. With reference to the fig.36b, this variant is derived introducing the following modifica- 
55 tions inside the 3-th variant: 



• a new classification variable 
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M NS 

d =Y5V 

is determined at first; 
• vector 



(0"J =sort{d w J 

and corresponding |W are determined; 

• multiple mask matrix of Eq. 36 is unnecessary and the and the M w mask of Eq. 37 is now a cumulative mask 
vector; SUM 

• the dimension of tlje useful set is now L=px Wand that of the noise set (W - L), the two variances are estimated 
from vector (O w ) consequently; 

• threshold 



is estimated from the following expression: 



-i 



In 



(M*NS) 



l~P 



Eq. 22c 



still obtained minimizing the probability of uncorrect classification for the Erlang distribution 



\ °i J 

• deep temporal merging j) and spatial merging k) are both unnecessary and the cumulative mask vector is mapped 
onto the tA^ k) band mask.. 

STB 

[0079] The 7-th variant, adds up the teaching of the preceding adaptive-RR patent application [REF. 2] and that of 
the first embodiment of the present invention, but it can be immediately extended to all the preceding variants. The 
already introduced arguments on the adaptive-RR method further integrated with the following ones meet the sufficient 
description requirements for the relevant part of the combination. More precisely: 

• Estimation of the H LS channel matrix is well known in the art. 

• Since the user can be considered as decoupled, the H LS channel matrix is spatial and temporal pre-whitened for 
all the K users separately, similarly to Eq. 10 (or Eq. 10'). 

• Matrix ZW is approximated by the low-dimensional channel A^B^, where 

are r k orthonormal whitened beamformers and 
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B;=k,-.bij=(x»xfr 



5 B k are r k orthogonal whitened FIR filters, so that 

Z {k) ~A k B k H holds. Eq.40 

*0 • The r k -dimensional spatio-temporal components (Reduced Rank) is obtained by minimizing the error of the ap- 
proximation: 

A; ,B, = arg mm \z k - A' k B'? \ p Eq. 41 

in the Frobenius norm. The optimization is known to be given starting from defining the following matrixes: 



V* = Z*Z". torn = 1 .-/•* Eq.42 



U*=eigv rt {\v} Eq.43 



A 



where eigv rk where denotes the Mx r k matrix of the r k leading eigenvectors of the matrix Whitened estimated A , 



are given by: 



Afc-Ofc Eq.44 



30 = " k 

B; = (X^) 1/2 Z H ^ 2 U fc Eq.45 

35 • The optimum criterion stated in Eq. 41 cannot be carried out in practice, therefore r k is obtained according to the 
information theory criterion based on the Kullback-Leibler information measure for the estimation of the number 
of independent sources impinging on a uniform linear array, as proposed in the article [REF. 8] of M. Wax, I. Ziskind, 
"Detection of the number of coherent signals by the MDL principle", IEEE Trans. Acoustic Speech and Signal 
Processing, Vol. 37, pp.1190-1196, August 1989, which is incorporated by reference. The information theory cri- 

40 terion is based on the Minimum Description Length (MDL), applied in the present case for the description of the 

temporal covariance matrix of the full-rank channel matrix Z k relative to the user. The MDL algorithm for the 
selection of the optimum r k avails of the following steps: 

1 . define a vector L (M x 1 dimensioned) of null elements; 
45 2. compute a matrix T = Z*z" and its eigenvalues \j - (1 =1 ,...,M); 

3. execute for p=0,...,M-1 
4. 

50 



55 
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L{p + \) = -W(m-p)log 



Eq.46 

5. minimum of the vector L -» (pos, vaf) (position inside the vector, minimum value); 

6. if (pos-1 =0) then r k = pos-1 ; r k is the output of the MDL algorithm (repeated for each user). 

• The matrix Z k (Eq. 40) obtained as above is then submitted to the first embodiment of the invention, or its variants 
1 to 4, to benefit of the further complexity reduction. At the end the band channel matrix H STB of fig.7 is obtained. 

[0080] Now, with reference to the remaining Figures 37 to 41 , a Base Station receiver suitable for radiomobile Base 
Stations exploiting a set of M antennas is disclosed. The receiver depicted in fig.39 includes a channel estimator based 
on the Temporal Masking under-parameterization method already disclosed and a Data Detector able to exploit the 
resulting reduced-complexity channel matrix. The Base Station receiver of fig.39 is valid for CDMA systems but can 
be easily arranged for the GSM. 

[0081] Fig.37 shows a discrete time model of a M-antennas CDMA system of the prior art modeling from the trans- 
mitted data to the received sequences. The arguments developed for illustrating the model of fig.37 allow to define a 
matrix A visible in fig.38b and known in the art. The aim matrix A is that of supporting data detection. For the case of 
the present invention a new Matrix A STB , partially visible in fig.38c, assumes the same structure as the matrix A but 
with a modified content which results from the use of a Temporally Masked channel matrix H STB . The new matrix A STB 
supports the operation of the Data Detector belonging to the radiomobile Base Station receiver of fig.39. 
[0082] With reference to the fig.37 the overall CDMA system is composed of M identical sub-models SBM-1, 
SBM-M each relevant to the links between all the K users and a m th antenna, so that all the possible M x K links 
between the K users and the M antennas are considered by the model. Each sub-model SBM -1 , .... SBM -M includes 
K cascades of: 

• an 1 :Q oversampler of the datum d k) transmitted from the tf* user; 

• a block 6 k) including the OVSF k (drthogonal Variable Spreading Factor) code sequence for the lt h user; 

• a block tF k,m ) including the relevant channel impulse response previously estimated. 

n 

[0083] Each cascade outputs a respective sequence st k,m ^ directed to an input of a digital adder ADU( m > which 
generates a summation sequence S* m) for the transmissionVom the K transmitters to the m th antenna. A second adder 
ADN( m > adds a Gaussian noise n^Xo the sequence s <m) obtaining a more realistic reception sequence e (m) for the 
aforementioned l|nks. A)i the M reception sequences e (l *, ... e (M) reach a DATA DETECTOR block that joinfly detects 
the original data </V»,c/ K) . n n 

[0084] Thanks to its modular structure a detailed description of the only SBM-1 sub-model can be easily extended 
to the whole system. With reference to the SBM-1 block, tr\ cP\— ,(/ k \- ,c/ K) represent data transmitted at symbol 
time T s from as many K users in one semi-burst Data blocR 1(2? of fig.^8. Each user transmits a sequence of data: 

dP = ld»4».~.<tf} T M,.-.K Eq.47 

constituted by N symbols; each of them is expanded by a Q-chips code: 

c <k) =[c 1 <*>,4*>,-,c 0 <*»J r *.1,...K Eq.48 

different from each user. Expansion is obtained by firstly oversampling of a Q factor each symbol c/** and then DS-CD- 
MA (Direct Sequences CDMA) modulating the oversampled symbol with the respective OVSF (brthogonal Variable 
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Spreading Factor) code sequence d*). The expansion can also be seen as the cascade of a 1 :Q oversampler and a 
FIR filter whose impulse response is just the sequence c(*>. The propagation towards the antenna 1 of the BTS is 
schematized with the impulse response: 

= [/,,<*■'>. hW>,», h w ik -Y *- 1. -,K Eq. 49 

constituted of W samples at chip time T c including the combination of the transmission and reception filters (radix of 
raised cosine frequency responses) and the channel h(ty k >V. Eq. 49, extended to consider all the M antennas, gives 
10 rise to a partition of the H LS channel matrix into M sub-matrixes H* m) like the one (fig.37). 

[0085] OVSF code and channel combination takes to the defirAfion of a linear system at chip time T c having the 
following time response: 
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„<M) = ^M..., b W^ ]T= C (K . h ,*.1) k m , ... K Eq 5Q 

where * is the convolution operator. The received sequence: 

e -ie 1 ,e 2 , »0/v.o+iv-iJ tq. oi 

results as the summation of Asynchronous sequences sMi) each A = N x 0+ W- 1 long, perturbed by an additive 
AWGN noise: 

n =[n t , n 2 .*",n N o +lv . 1 J Eq. 52 

with covariance matrix: 



R t (,) = 4n (,) n (,w )]=a 2 I A . Eq.53 

35 [0086] The signal 

40 Jt=l 

received at chip time on the antenna 1 relatively to all the K users can be written as : 

« e- (,) = A (1) -d + n (,) Eq.54 

having indicated with d the reorganization of the overall data vector for the K users into N blocks of K symbols each 
(sorted by symbol 1,....,A/) as in the following: 



and with 
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the signature matrix that includes, for all the N chips, the convolution obtained by vectors b^ 1 ) in the following way: 

a O(n-1)+ UK (n-1)+k " D l tc 4- ^ 

per fr = 1 ," ,/<; n= V-.Aft /= 1-GfW-l and null elsewhere. 

[0087] The discrete channel model for the link between the user and ail the M antennas can be represented by 
a channel vectors HW including the M channel vectors: 

H ( *' = [h"' ,, 1 hM,-,h M ,-mh^] T k = 1,-./C 

hlML^jM)^".)...^.)^ *=1.--.K e m = 1,..,M. Eq.57 

[0088] The combination between the spreading code c(*> and the channel h(*- m > results: 

B<"> = [b^K b<*» ~, b<*' m \-,b <W V k = 1 ,-.K 

b (*. m) = [b (*.m)_ b ^>,.,t(W/ = c l*). h (*.»» *= V.K; fll-l.-.M Eq.58 

[0089] The sequences received at chip time on all the M antennas relatively to all the K users are: 

E = [e (1) ,e (2) ,-.e<' n >,",e (M) ] r 

e {m) = le\ m \t{ m \~-,b™ +w ./ Eq.59 
and they result as the summation of /(sequences each A = /vxQ + l/V-1 long, perturbed by an additive AWGN noise: 

N=[n< 1 >,n< 2 »,---:n ( '">n ( *V M.-.K 

n (m » = [n ( 1 m) ,n< m >,-,n^ +kv . 1 ] r ; Eq.60 

40 

each sequence n( m ) has a space covariance matrix: 

R-,=k} ^^"^l Eq.61 

The received signal e< m > can be generalized from the Eq. 54 valid for signal e< 1 ) as: 

50 e l(m) =A (m) -d + n (m> Eq.62 

where A< m ) is the generalization to the mt h antenna of the signature matrix At 1 ) defined in Eq. 56: 

in such a case Eq. 58 is modified to express the combination between the spreading codes cW of ail the K users and 
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the channel h^« m ) as: 

where b(^ m ) is defined as in the Eq.58. Matrix B( m > of Eq. 64 is visible in fig.38a. Introducing the further definitions: 

e=[e< , > r ,e (2)r ,-,e (n ' )T 1 - e <M)7 ]r, Eq.66 

A=[A (1 »V 2)T ,-.A (m > T ,-,A(M> r ] T , Eq.66 

n=[n< 1 ) T ,n (2 ' r ,-.n (n " r > -,n <M)T ] r , Eq.67 
the following overall system for the link between all the K users and all the M antennas can be written: 

20 e=A-d+n, Eq.68 
for the so reorganized sequences the space noise covariance matrix R 0 is: 

R„=4l-n"]=R,„<g>I A . Eq.69 

The structure of the overall signature Matrix A of Eq. 66 is visible in fig.38b. 
30 [0090] The NxQx M sub-matrixes B< m ) of f ig.38a of the prior art are unmasked (so as the overall signature matrix 
A) and the complexity of the detection is not reduced consequently. On the contrary, for data detection in presence of 
the band masked channel matrix H STB of the present invention a masked signature matrix A STB is provided, as shown 
in fig.38c. Matrix A STB is modular like matrix A, each module includes Nx OB (m) masked sub-matrixes like those 
visible in f ig.38d. An m th B (m) sub-matrix is defined as in Eq. 64: 
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STB 

a( m ) _ rh< 1 ''") h< 2 ' m > ... h ( *' m > ...h (K ' m) i r U - 1 ... K • 
D $TB-l D $TB > D $TB • > D S7B » D STB * K ~ '» » A ' 



and can be obtained from a sub-matrix like the H (1 ^ of f ig.37. The b^'"* vector is forced to assume a time masking 
40 like that of the rrt h line of the H* ^ matrix^relevant for L ine tf h user. For fxample, b {k ' m) reflects the shape of the band 
matrix visible in fig.7. Furthermore, because of the preceding band masking, the b**^ vector (m = 1 M) relevant to 
a k? h user has the same time structure for all the M antennas, so as all the b^ m ^ vector drawn up in colums in matrix 

STB 

A STB have the same time structure. This is the final effect on the data detection consequent to have Band Masked the 
H ls channel estimate. Matrix A STB enters an overall system like that of Eq. 68 for the detection of the transmitted data 
45 sequences by the receiver; the final system is: 

e = A srs • d + n. Eq. 70 

so in which the overall space noise covariance matrix of noise n shall be re-estimated in view of the Space-Time masked 
channel H STB . Maintaining the same denomination R„ and indicating with R' ns the re-estimated R ns , Eq. 69 becomes: 



R„= J £:[nn // ]=R, u ®I A . Eq.69" 

[0091] The radiomobile Base Station Receiver of fig.39 includes the following main blocks: 
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• M identical BB-EXT blocks; 

• a complexity-reduction channel estimator block COMP-RED; 

• M identically structured signature blocks SIGN-1 , SIGN-M; 

• a ZF-BLE DATA DETECTOR block. 

5 

[0092] Generally speaking the names of the blocks indicate their functions. The M identical BB-EXT blocks are in- 
terposed between as many antennas 1, 2, M composing the set and both the COMP-RED and ZF-BLE DATA DE- 
TECTOR blocks. The channel estimator COMP-RED includes the following blocks: LS-CHEST, SPn-ES, MID, WST, 
RR (dashed), SORTER, CLASS, MASK, pdf-STIM, ADAPT-TRSH, MERGING, and BAND-MASKING. Block MID 

10 stores the K training sequences X transmitted in midamble position of the burst of fig. 1 8. The output of the channel 
estimator COMP-RED is the channel matrix H S7B sent to the input of all the M signature blocks SIGN. Each block 

SIGN-1 , SIGN-M includes a group of K memory blocks SPRD-1 , SPRD-K for storing as many OVSF k code 

c<*\ and a group of A/ convolution^ blocks CONV1 CONV-N (the K oversampler Q of fig.37 are omitted for brevity). 

Each signature block SIGN-1 f , SIGN-M receives a respective H (m) sub-matrix of the H S7B matrix channel relevant 

15 to the m th antenna; the H (m) sub-matrix is forwarded to the group s of N blocks CONV1 , , CONV-N, each of them 

performing a replica of the convolution between the K codes cM and the H* m * sub-matrix obtaining an m th partition of 
the overall signature matrix A STB (Figures 38c and 38d) coming to the blocC^F-BLE DATA DETECTOR. In case the 
receiver of fig.39 has to be arranged for a GSM (or GSM-derived) system the M blocks SIGN don't include the SPRD 
and CONV groups but the structure of the new A STB matrix is immediately referable to the preceding one. 

20 [0093] As far as the operation of the channel estimator block COMP-RED concerns, each block BB-EXT provides to: 

• correctly receive the RF signals transmitted from all the K users; 

• RF filtering and down converting the overall received signal; 

• Q-PSK demodulating, sampling and A/D converting the base band signal; 

25 • base band filtering with a digital filter matched to the chip obtaining the base band digital samples Y, e corresponding 
to the K transmitted bursts of fig. 18 received on all the M antennas, in correspondence of the Training sequence 
and the two Data blocks, respectively. 

[0094] Elements of Y and e matrix are stored in the relevant blocks BB-EXTs to be processed by the successive 
30 blocks COMP-RED and ZF-BLE DATA DETECTOR, respectively. Both matrix Y and the corresponding stored midamble 
matrix X reach the input of LS-CHEST block for estimating the full-complexity channel matrix H LS with Eq. 4 directly, 
or using some alternative known method. Matrices Y, X and H LS also reach the inputs of SPn-ES block which estimates 
the space noise R ns matrix by means of Eq. 5. Matrices X, H LS and R n3 are inputted to the WST block for spatially (and 
in case also temporally) pre-whitening the LS channel matrix H LS , as in Eq. 10' (or in Eq. 10), obtaining the whitened 
35 channel matrix Z to be complexity reduced. Downstream dashed block RR only concerns the 7-th variant and takes 
care of the adaptive rank reduction of the Z channel matrix. The remaining blocks of COMP-RED perform the classi- 
fication algorithm of the first embodiment of the method invention and its variants. More precisely: 

• block SORTER sorts by decreasing values the elements z of the Z matrix and provides them to the cascaded block 
40 CLASS; 

A 

• block CLASS receives a threshold S from block ADAPT-TRSH and classifies the sorted elements z in order to fill 
the set Vand V under the hypotheses H t and H 0 , respectively; 

A Ap Ao 

45 • block pdf-STIM receives the classified elements z in orj^er to estimate the pdf parameters , a Q sent to the 
block ADAPT-TRSH, together with an initial probability p irp in order to estimate the threshold S; 

• block MASK receives from the upstream block CLASS the final classified elements z for generating the space- 
time mask matrix M ST of fig.7a; 

50 

• block MERGING receives the matrix M ST from the preceding block MASK and performs the spatial and temporal 
merging in order to obtain the band mask matrix M STB of f ig.7b; 

• block BAND-MASKING receives both the M STB mask matrix from the preceding block MERGING and the channel 
55 matrix H LS from the LS-CHEST block, in order to band masking the matrix H LS and obtaining the output matrix 

H STB sent to the block SIGN and the SPn-ES block. This last re-estimates a new space noise covariance matrix 
R' ns using the H STB matrix in as in the following: 
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R ns = R >y* H STB R xy E Q- 5 ' 

[0095] Block ZF-BLE DATA DETECTOR is a so-called "Zero- Forcing Block Linear Equalizer" which is a data decor- 
relator known in the art (except for the particular structure of the A STB matrix). In case of CDMA systems the data 
detector block operates on the expression e = A STB ■ d + n (Eq. 70) minimizing the quadratic form: 

(e - A OT • df R'J (e - A m • d) Eq. 71 

whose aim is that of eliminating both I SI (InterSymbol Interference) and MAI interference without considering the noise 
level. The solution of Eq. 71 is: 

<H ^STB^'n A S7B) 1 - A STB R *n e E Q- 72 



suitable for noise levels not excessively low. The noise covariance matrix R* 1 is that of Eq. 69*. The particular structure 
of Eq. 72 indicates the presence of two cascaded blocks, namely a Whiten?ng Matched Filter (WMF) A* R" 1 acting 
20 on the reception signal e for estimating data named d MF at this stage, and a second block (A^ w R n A^)* 1 for 
canceling IS I and MAI from the d MF data. 

[0096] A valid alternative to the ZF-BLE DATA DETECTOR block is a so called "Minimum Mean Square Error - Block 
Linear Equalizer" (MMSE-BLE), referred to the result of the expression: 



A A 

(d-d) (d-d) Eq. 73 

which is the expected MSE value on the data estimation. The solution of the minimization problem leads to: 

d = (Astb R n A STB + R'J)" 1 AsTB R n © Ec l- 74 

with 



R d =E[dd H ]. 



The MMSE-BLE equalizer doesnt entirely eliminate ISl and MAI interference except for infinite SNR, nonetheless it is 
known in the art that BER at the output of this is equalizer is lower than ZF-BLE for all the noise levels and number of 
users. 

[0097] Looking at the content of the A STB matrix it can be appreciate the great deal of null elements which strongly 
simplify the data detection and improve the performance of the receiver, as shown in the Figures 40 and 41. The two 
Figures are referred to comparative simulations of the receiver of the present invention, complete of channel estimation 
and S-T multi-user detection, faced to a GTU channel. Performance is evaluated for uncorrected and correlated noise 
in a tri-sectorial cellular system. The correlated noise is generated by 3 intercell interferers randomly located within the 
120° support. Simulations involve both the statistical parameters of the GTU channel and the operation of the radio- 
mobile Base Station Receiver of f ig.39. The last is engaged in time slotted CDMA proposed for 3 rd generation system 
(TDD-UTRA) by UMTS standard ETSI: "Physical channels and mapping of transport channels onto physical channels 
(TDD), TS-125 221 , V3.3.0 2000-06; [RER 9]. Numerical results are for the following scenario: M = 8 antennas; K = 8 
active users randomly located within the tri-sectorial cell; channel length up to W = 57 chips; spreading code 0=16 
chips/symbol; chip shaped as squared raised cosine with roll-off 0.22; and chip rate of 3.84 Mchips/s (Traffic Burst 1 
of TDD-UTRA). The length of the data block is N b = 61 QPSK modulated symbols, training sequences are chosen 
according to standard specifications with length N m = 512. 

[0098] Ffg.40 shows the (average) performance for spatially uncorrelated noise in terms of BER for uncoded bits 
vs. SNR = E bi /N 0 . Fig.41 shows the (average) performance for spatially correlated noise in terms of BER vs. SIR for 
3 intercell interferers. In the two Figures the BER for perfect channel state information (H, R ns ) is shown as reference. 
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The comparison with respect to the LS estimate of the S-T channel matrix indicates that the gain of the radiomobile 
Base Station Receiver of fig.39 (LS+TM) is approximately 4 + 6 dB for all the SNR/SIR values considered here. 



Claims 

1. Method for under-parameterizing the channel estimate in a radiomobile Base Station Receiver exploiting set of 
M-antennas for receiving user data bursts transmitted from one or more simultaneously active mobile stations on 
an assigned carrier, including the steps of: 

a) estimating, in correspondence of a training sequence included in each data burst and known to the receiver, 
the elements of a full-dimensional space-time channel model settled as a memory matrix table (H ls ) including 
the Wmost significant samples of the channel impulse response relevant to the M links insisting between the 
one or each mobile station and the M-antennas; 

b) estimating a spatial noise covariance matrix (R ns ) from the residual of the channel estimation, assuming as 
known the temporal correlation of the noise on the various channels, characterized in that includes the further 
steps of: 

c) filtering said full-dimensional space-time channel matrix (H LS ) by means of said spatial noise covariance 
matrix (R ns ) to make spatially uncorrelated, or spatially whitened, the noise on the various channels in order 
to describe the elements of said full-dimensional space-time channel matrix (H LS ) by means of a relevant 
statistical distribution of the amplitudes; 

d) classifying as useful or noise the elements of the whitened set (Z LS ) using a discriminating threshold (S) 
which is dynamically estimated in order to minimize the probability of uncorrect classification; 

e) zeroing the elements of the full-dimensional space-time channel matrix (H LS ) corresponding to the elements 
classified as unuseful. 

2. Method for under-parameterizing the channel estimate according to claim 2, characterized In that before the 
classification step d) said full-dimensional space-time channel matrix (H LS ) is additionally filtered by means of a 
correlation matrix of one or more uncorrelated training sequences assigned to one or more users one to one, in 
order to makes temporally uncorrelated, or temporally whitened, the channel estimation of each user. 

3. Method for under-parameterizing the channel estimate according to claim 1 or 2, characterized in that between 
the classification step d) and the zeroing step e) a further intermediate step 6) is introduced for calculating the 
percentage of the elements classified as useful relatively to the elements relevant to all the M antennas at the 
same discrete time instant and taking as useful too all the remaining elements of the whitened set (Z LS ) relevant 
to the same discrete time instant if said percentage is greater than a fixed value depending on the maximum 
distance between adjacent antennas, otherwise zeroing the useful elements contributing to said percentage. 

4. Method for under-parameterizing the channel estimate according to claim 3, characterized in that said fixed value 
of the percentage calculated at step 5) is at least 30%. 

5. Method for under-parameterizing the channel estimate according to claim 3 or 4, characterized in that step 5) is 
extended for taking as useful too all the elements of the full-dimensional space-time channel matrix (H LS ) relevant 
to discrete time instants immediately preceding and following an useful element, in order to match those channels 
introducing temporal correlation between adjacent impulses of the channel responses and neutralize possible 
unsynchronizations. 

6. Method for under-parameterizing the channel estimate according to one of the preceding claims, characterized 
in that the zeroing step e) is carried out through a mask matrix {M ST , M S7B ) with 1 in the position of the elements 
classified as useful and zero elsewhere. 

7. Method for under-parameterizing the channel estimate according to any claim from 3 to 6, characterized in that 
said full-dimensional space-time channel matrix (H LS ) includes as many W-eJements vectors (hW.—.hM) as the 
number of the M antennas and by effect of the zeroing step e) the resulting space-time channel matrix assumes 
a band configuration (H STB ) relevant to each simultaneously active user. 

8. Method for under-parameterizing the channel estimate according to one of the preceding claims, characterized 
in that the transmission carrier is assigned time division to the active users and under-parameterization is extended 
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on NS successive time slots at a time, the length NSoi the estimation window depending on the stationariness of 
the channel and/or the mobile speed. 

9. Method for under-parameterizing the channel estimate according to the preceding claim, characterized in that 
said spatial noise covariance matrix (R ns ) is averaged on said number A/Sof successive time slots and the average 
spatial noise covariance matrix is used for spatially filtering said full-dimensional space-time channel matrix (H LS ) 
at the filtering step c). 

10. Method for under-parameterizing the channel estimate according to the preceding claim when it depends on claim 
6, characterized in that said mask matrix (M ST , M STB ) for each active user is calculated by the following steps of: 



M SUM = (m sum (m,w)} 



^ST-multislot — ^ST-multUiotC^'^)} 

• calculating 



{m sum (m,w)} = Y,m ST . mMlot (m,j\ 

ns-Q 



j = w + ns x W 

where M ST _ mu , tislot corresponds to the union of NS successive mask matrix (M ST ,M S7B ) and m=1 m*=1 W; 
ns=1, -,A/S; 

• introducing a discretional probability p deep such as if: 

( m SUM ( m - W )} > Pdeep * NS => {m ST (w, w)} = 1 

being 

{m CT (m,w)} 

as many elements of said mask matrix (M ST > M srs ) valid for the actual time slot. 

11. Method for under-parameterizing the channel estimate according to the preceding claim, characterized in that 
said discretional probability p dB8p is at least 0,25 x NS. 

12. Method for under-parameterizing the channel estimate according to one of the preceding claims, characterized 
in that further includes the following steps of: 

• operating on the elements of the whitened set (Z LS ) in order to obtain a classification variable {z m w , z m wns , 
d mt w d Wf ns> d w ) spanning a relevant classification set; 

• arbitrarily assigning an initial probability (p in ) that an element of the classification set belong to the useful 
subset (H,); 

• determining a suitable mathematical expression for estimating said discriminating threshold (S) that minimizes 
the probability of uncorrect classification from the statistical distribution of the classification variable {z m w , 

z m,w t ns> ^m,w* ^w.ns' 
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• recursively executing the classification step d) for estimating the discriminating threshold (S) and the statistical 
parameters of the useful (fy) and noise (H 0 ) subset starting from said initial probability (p in ) and the suitable 
mathematical expression of the discriminating threshold (S); 

• mapping the elements classified as noise into corresponding elements to be zeroed of said full-dimensional 
space-time channel matrix (H ls ). 

13. Method for under-parameterizing the channel estimate according to the preceding claim, characterized in that: 

• the classification variable (z m w ) coincides with the module of any element of the whitened set (Z LS ); 

• the dimension of the classification set is M x W; 

• said statistical distribution is the Rayleigh distribution; 

• said mathematical expression which minimizes the probability of uncorrect classification is the following: 
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where: S is said discriminating threshold, is the variance of the useful subset (H,), o*/is the variance of the 
noise subset (H 0 ), and p is the probability that an element of the classification set belong to the useful subset (H,). 

14. Method for under-parameterizing the channel estimate according to claim 12 when it depends on claim 9, char- 
acterized in that: 

• the classification variable (z m wn ^ coincides with the module of any element of the NS sequential whitened 
set(Z ts ); 

• the dimension of the classification set is M x W x NS. 

• said statistical distribution is the Rayleigh distribution; 

• said mathematical expression which minimizes the probability of uncorrect classification is the following: 



v-l 
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tf 2 



where: S is said discriminating threshold, a* is the variance of the useful subset (H^, is the variance of the 
noise subset (H 0 ), and p is the probability that an element of the classification set belong to the useful subset (Hj). 

15. Method for under-parameterizing the channel estimate according to claim 12 when it depends on claim 9, char- 
acterized in that: 

• the classification variable (d m w ) is obtained summing up the square module of NS corresponding elements 
upon the NS sequential whitened set (Z LS ); 

• the dimension of the classification set is M x W; 

• said statistical distribution is the Erlang distribution; 

• said mathematical expression which minimizes the probability of uncorrect classification is the following: 
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where: S is said discriminating threshold, a 2 is the variance of the useful subset, a 2 is the variance of the noise 
subset, and p is the probability that an element of the classification set belong to the useful subset (H^). 

16. Method for under-parameterizing the channel estimate according to claim 12 when it depends on claim 9, char- 
acterized in that: 

• the classification variable (d wn ^ is obtained summing upon the NS whitened set (Z ls ) the square module of 
elements relevant to the M antennas at the same discrete time instants; 

• the dimension of the classification set is W x NS; 

• said statistical distribution is the Erlang distribution; 

• said mathematical expression which minimizes the probability of uncorrect classification is the following: 
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where: S is said discriminating threshold, a 2 is the variance of the useful subset, o 2 is the variance of the noise 
subset, and p is the probability that an element of the classification set belong to the useful subset (HJ. 

17. Method for under-parameterizing the channel estimate according to claim 12 when it depends on claim 9, char- 
acterized in that: 



• the classification variable (d w ) is obtained summing up the square module of corresponding elements upon 
NS sequential whitened set (Z LS ), and still summing up the resulting elements relevant to the M antennas at 
the same discrete time instants; 

• the dimension of the classification set is W; 

• said statistical distribution is the Erlang distribution; 

• said mathematical expression which minimizes the probability of uncorrect classification is the following: 
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where: S is said discriminating threshold, a 2 is the variance of the useful subset, a* is the variance of the noise 
subset, and p is the probability that an element of the classification set belong to the useful subset (H,). 

18. Method for under-parameterizing the channel estimate according to any claim from 12 to 17, characterized in 
that said recursive estimate at step d) includes the following sub-steps of: 

d1 ) sorting by decreasing absolute values the elements of the classification set, tabled in association with their 
positional indexes inside the original set; 

d2) multiplying said initial probability (p /n ) by the dimension of the classification set obtaining the initial dimen- 
sion (L) of the useful subset (H^) which includes a number of higher sequential elements of the sorted set 
equal to said initial dimension (/.); 

d3) calculating the initial dimension of the complementary noise subset (H 0 ) including the remaining sequential 
elements of the sorted set; 

d4) estimating the variance (a 2 , a 2 ) of the useful and noise subset (H v Hq) by the respective dimensions; 

d5) calculating said mathematicaP expression for obtaining a value of said discriminating threshold (S); 

d6) classifying the elements of the sorted set taking as useful the elements whose squared module is greater 

than the threshold (S) and noise the others; 

d7) recalculating the dimensions of the useful and noise sorted set; 

d8) comparing the new dimension (L) of the useful subset {HJ with the preceding one and stopping the clas- 
sification if the two values are near the same, otherwise calculating a new probability (p) that an element of 
the sorted set belong to the useful subset (H,) and repeating the sub-steps from d4) to d8) cyclically until the 
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end condition is met; 

d9) addressing the elements of the full-dimensional space-time channel matrix (H LS ) to be considered as noise 
by the corresponding tabled positional indexes. 

5 19. Method for under-parameterizing the channel estimate according to any claim from 12 to 18, characterized in 
that it further includes the following sub-steps of: 

• subtracting from the variance (a* ) of the useful subset (H, ) the squared value of said discriminating threshold S; 

• calculating the probability p that an element of the sorted set belongs to the useful subset (Hj ) by the following 
10 expression: 



15 



P=- 



-L-e 2o ° 

2o? 2ol 



20 where: L is the dimension of the useful set (H ^)]L FD is the dimension of the classification set; o* and a* are the 

variance respectively of the useful and noise subset (H lf H 0 ). 

20. Method for under-parameterizing the channel estimate according to one of the preceding claims, characterized 
in that the rank of the whitened set (Z LS ) is reduced before the classification step d). 

25 

21 . Radiomobile Base Station receiver connected to a set of M antennas and carrying out the under-parameterization 
method of the claim 1 , including: 

a) channel estimation means (LS-CHEST, MID) for estimating the elements of a full-dimensional space-time 
30 channel model settled as a memory matrix table (H LS ) including the Wmost significant samples of the channel 

impulse response relevant to the M links insisting on an assigned carrier between one or more simultaneously 
active mobile stations and the /W-antennas; 

b) noise estimation means (SPn-ES) for estimating a spatial noise covariance matrix (R ns ) from the residual 
of the channel estimation, characterized in that further includes: 

35 c) filtering means (WST) arranged to use the spatial noise covariance matrix (R ns ) for filtering said full-dimen- 

sional space-time channel matrix (H LS ) making spatially uncorrected, or spatially whitened, the noise on the 
various channels, in order to describe the elements of said full-dimensional space-time channel matrix (H LS ) 
by means of a relevant statistical distribution of the amplitudes; 

d) classification means (SORTER, CLASS, pdf-STIM, ADAPT-TRSH) for dynamically classifying as useful 
4 o (HJ or noise (H 0 ) the elements of the whitened set (Z) using a discriminating threshold (S) statistically esti- 
mated; 

e) masking means (MASK, BAND-MASKING) generating a mask matrix (M sr , M S7B ) for zeroing the non-useful 
elements of the full-dimensional space-time channel matrix (H LS ) corresponding to the elements classified as 
noise in the whitened set (Z); 

45 f) data detecting means (SIGN-1 M, ZF-BLE DATA DETECTOR) arranged for using the masked space-time 

channel matrix (H STB ) for estimating the transmitted data. 

22. Radiomobile Base Station receiver according to claim 21, characterized in that said filtering means (WST) is 
arranged for temporal filtering said full-dimensional space-time channel matrix (H LS ) by means of an autocorrelation 

so matrix of one or more uncorrelated training sequences assigned to one or more users one to one. 

23. Radiomobile Base Station receiver according to claim 21 or 22, characterized in that it includes merging means 
(MERGING) acting on said mask matrix (M ST ) for zeroing a column as a whole if the number of corresponding 
column elements classified as useful is equal or less than a fixed percentage depending on the distance between 

55 adjacent antennas, obtaining a band-mask matrix (M sr0 ). 

24. Radiomobile Base Station receiver according to the preceding claim, characterized in that said merging means 
(MERGING) acts on said mask matrix (M ST ) in order to unmask (M STB ) all the elements relevant to sample-times 
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immediately preceding and following an element classified as useful, in order to match those channels introducing 
temporal correlation between adjacent impulses of the channel responses and neutralize possible unsynchroni- 
zations. 

5 25. Radiomobile Base Station receiver according to any claim from 21 to 24, characterized In that said classification 
means (SORTER, CLASS, pdf-STIM, ADAPT-TRSH) operates recursively for estimating said discriminating 
threshold (S) and the statistical parameters (a*, a*, p) the elements of whitened set belong to the useful or noise 
subset, starting from an arbitrarily assigned initial probability (p /n ) that an element of the whitened set (Z) belong 
to the useful subset (H t ). 

w 

26. Radiomobile Base Station receiver according to any claim from 21 to 25, characterized in that cascaded to said 
filtering means (WST) it further includes means (RR) for reducing the rank of said whitened set (Z). 

27. Radiomobile Base Station receiver according to any claim from 21 to 26, characterized in that said noise esti- 
15 mation means (SPn-ES) re-estimates the spatial noise covariance matrix (R ) by means of the masked space- 
time channel matrix (H $TB ). 5 

28. Radiomobile Base Station receiver according to the preceding claim when it depends on claim 23, characterized 
in that said data detecting means (ZF-BLE DATA DETECTOR) includes M convolutional modules (SIGN-1 , 

20 SIGN-M) relevant to as many incoming band-masked sub-matrixes (H* m) ), the rrt h sub-matrix being associated 

to the estimates of K users originated from an m* h antenna, for generating an overall signature matrix (A S7B ) having 
a generalized band structure suitable for joint detecting /(data bursts transmitted from K simultaneously active 
users. 

25 29. Radiomobile Base Station receiver according to the preceding claim, characterized in that each convolutional 
module (SIGN-1 , SIGN-M) includes as many replicas of the convolutional means (CONV-1, CONV-N) as 
the N data chips of the transmission burst. 

30. Radiomobile Base Station receiver according to the preceding claim, characterized in that that each replica of 
30 the convolutional means (CONV-1, CONV-N) performs the convolution between the W estimates of the k? h 

active user stored in said incoming band-masked sub-matrix (H* m) ) and a k* h spreading sequence (c^ belonging 
to K orthogonal spreading sequences assigned to the users one$>one, obtaining a signature vector (bn."V,b' 2 ' m >, 
*: t yKmfy having the same band structure as the masked space-time channel matrix (H S78 ) relevant to said user. 

35 
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COST-259 Channel Typologies 
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Channel matrix for single user and M antennas 
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DISTRIBUTION FUNCTIONS OF USEFUL 
CHANNEL CHIPS 
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DEGREES OF FREEDOM DISTRIBUTION 
FUNCTIONS (Fig. 10) 
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COMPARISON AMONG NORMALIZED MSE 
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Channel model for single /ca user and single m th antenna 
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Midamble matrix for K users and single m th antenna 
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POSSIBLE DISTRIBUTIONS FOR 
NOISE AND USEFUL SAMPLES 
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POLARIZATION OF THE ESTIMATED VARIANCE OF THE OVER 
THRESHOLD SAMPLES FOR THE TWO CLASSIFIERS 
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5-th VARIANT (extended-multislot & compact-antennas) 

Dimension of the resulting statistical set = WxNS; pdf = Erlang 
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